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INTRODUCTION 
Premalignant  Lesions  and  Cancer 

The  development  of  solid  tumors  is  generally  thought  to  be  a  multistep  process,  whereby 
successive  genetic  events  occur  in  a  normal  cell  to  render  it  increasingly  malignant.  The 
accumulation  of  many  such  changes  is  widely  assumed  to  result  from  an  underlying 
genetic  instability  [1].  In  prostate  cancer  (CaP),  the  genetic  and  epigenetic  phenomena 
that  result  in  cancer  are  not  well  understood,  but  there  is  evidence  that  premalignant 
lesions  exist  in  the  prostate  may  precede  the  development  of  cancer  by  many  years. 
Understanding  the  molecular  mechanisms  underlying  the  early  steps  of  CaP 
tumourigenesis  will  contribute  toward  the  identification  of  molecular  or  recurrent  genetic 
biomarker(s)  crucial  for  signaling  early  transformation  changes  that  lead  to  CaP.  High 
grade  prostatic  intraepithelial  neoplasia  (HPIN)  is  regarded  as  the  most  likely  precursor 
of  prostatic  adenocarcinoma  and  its  histological  detection  in  biopsy  is  considered  a  risk 
factor  for  CaP  development  [2]-[4]  HPIN  can  be  found  in  up  to  70-80%  of  glands  that 
show  presence  of  carcinoma.  It  is  considered  that  in  about  1/3  of  patients  HPIN  will 
progress  to  invasive  carcinoma  within  10  years.  Thus,  early  genetic  events  in  HPIN  are 
most  likely  to  be  key  in  driving  the  tumorigenic  process.  Our  progress  whilst  being 
supported  by  this  grant  allowed  us  to  develop  a  coherent  biological  model  that 
addresses  the  early  genetic  steps  in  the  transition  from  HPIN  to  neoplasia. 


BODY 

Chromosomal  Instability 

Chromosomal  instability  (CIN)  arises  as  a  result  of  aberrations  in  the  cell’s  mitotic 
machinery  and/or  in  the  structural  integrity  of  the  chromosomes  such  as  telomere 
dysfunction  and  is  thought  to  be  a  major  mechanism  underlying  tumorigenesis.  Our 
progress  in  CaP  before  the  start  of  this  work  was  centered  on  the  analysis  of  telomere 
erosion  associated  with  CIN  in  HPIN  (Vukovic  et  al.,  2003)  [4].  The  purpose  of  this 
project  funded  by  Award  DAMD17-03-1-0154  was  to  follow  up  on  this  observation 
and  to  study  gene  expression  profiles  in  the  context  of  genetic  instability,  telomere 
erosion  and  CaP.  Because  of  the  limited  numbers  of  cells  present  in  tiny  HPIN  lesions 
and  microfoci  of  CaP  it  was  necessary  to  optimize  methods  of  representative 
amplification  of  nucleic  acid  following  dissection  (tasks  a  and  b  in  SOW).  In  Hughes  et 
al.  (2004  -  Appendices  5  and  6)  we  optimized  array  comparative  genomic  hybridization 
(aCGH)  analysis  of  DNA  amplified  by  strand  displacement  primed  PCR  from  dissected 
homogeneous  tissue  samples.  In  follow-up  work  discrete,  individual  tumor  microfoci  and 
HPIN  were  dissected  from  radical  prostatectomy  samples  with  unfavorable  outcome. 
Microfoci  of  epithelial  acini  from  the  same  tumor,  separated  by  no  more  than  3mm, 
exhibit  significantly  different  aCGH  profiles  indicating  high  extent  of  CIN  within  CaP 
tumors  and  underscored  the  intrinsic  genotypic  heterogeneity  of  early  lesions  of  CaP. 
Significantly,  losses  of  chromosomes  8p,  13q  and  16q  were  recurrently  observed,  as  were 
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gains  of  chromosome  arm  8q  (with  concomitant  loss  of  8p).  This  latter  observation 
implicates  isochromome  8q  formation.  These  findings  were  consistent  with  existing 
cytogenetic  literature  based  on  metaphase  CGH  studies.  This  study  is  in  preparation  for 
publication  (Appendix  1). 

Damage  Response  Pathways  and  Application  of  Molecular  Profiling  in  the  Analysis 
of  HPIN  and  CaP 

Certain  components  of  DNA  damage  response  pathways,  originally  defined  by  their  roles 
in  the  repair  of  chromosomal  DNA  breaks,  are  found  at  telomeres  and  are  necessary  for 
normal  telomere  maintenance  and  functions.  Breaks  in  chromosomal  DNA  can  be 
repaired  by  nonhomologous  end-joining  (NHEJ)  or  by  homologous  recombination.  Many 
of  the  components  of  DNA  damage  response  needed  for  telomere  maintenance  or  cognate 
downstream  repair  proteins  are  represented  on  pathway-specific  GEArrays  (SuperArray, 
Frederick,  MD,  USA). 

Molecular  profiling  to  interrogate  the  transcriptome  in  the  different  stages  of  prostate 
tumourigenesis  (including  preneoplastic  HPIN)  has  been  hampered  by  the  microscopic 
size  of  these  premalignant  lesions  and  the  infiltrating  nature  of  the  disease  process  within 
the  gland.  Thus  traditional  tissue  RNA  extraction  techniques  are  not  applicable,  as  the 
contaminating  cells  that  constitute  the  majority  of  a  clinical  sample  compromise  the 
resulting  gene  expression  data.  The  use  of  microdissection  and  T7  amplification  with 
pathway-specific  macroarrays  (see  below)  has  provided  a  more  focused  hypothesis- 
driven  approach  to  profiling  subsets  of  genes  that  are  mechanistically  related  to  telomere- 
dependent  genomic  instability  and  preneoplasia. 

Identification  of  Topographical  Variation  of  Differentially  Expressed  Genes  in  CaP 
Associated  with  Telomere  Dysfunction  and/or  CIN 

According  to  the  rationale  for  the  experiments,  the  end  of  the  first  year  of  funding  is  to 
evaluate  the  role  of  damage  response  and  repair  signaling  pathways  associated  with 
telomere  erosion  and  acquisition  of  CIN  in  HPIN.  In  the  present  proposal,  we  have 
chosen  to  evaluate  the  DNA  damage  pathway  in  CaP  insteed  of  HPIN  (as  originally 
proposed),  as  our  previous  study  exhibited  significantly  different  aCGH  profiles 
indicating  high  extent  of  CIN  within  CaP  tumors  and  intrinsic  genotypic  heterogeneity  of 
HPIN  (Appendix  1).  Wedges  of  tissue  parallel  to  the  archived  sample  comprising  ten 
‘cancer  specimens  from  radical  prostatectomy’  were  snapped  frozen  tissue  and  embedded 
in  optimal  cutting  temperature  solution  for  analysis  of  expression  differences  in  DNA 
damage  signaling  and  genome  stability  pathways.  In  this  study  the  tissue  was  sectioned 
into  six  10pm  thick  slides  for  RNA  extraction  and  four  4pm  thick  slides  for 
immunohistochemistry  (IHC),  flanked  by  6pm  thick  slides  for  hematoxilin  and  eosin 
(H&E)  staining.  Microdissection  was  performed  only  on  samples  that  had  relatively  high 
tumor  content  (>75%)  but  lack  the  wide  spread  of  tumor  using  adjacent  H&E-stained 
sections  as  guidance.  RNA  was  extracted  using  the  Absolutely  RNA  micro-isolation  kit 
(Stratagene,  La  Jolla,  CA)  and  then  subjected  to  T7  linear  amplification.  In  the  present 
proposal,  we  have  chosen  to  use  the  Amino  Allyl  MessageAmp  aRNA  amplification  kit 
(Ambion,  Austin,  TX),  rather  than  Clonetech  SMART  PCR  amplification  (as  originally 
proposed)  as  our  pilot  data  demonstrated  superior  yield  and  fidelity.  Amplified  RNA  was 
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labeled  and  hybridized  to  pathway-specific  GEArray  following  the  manufacturer’s 
instructions  (SuperArray,  Frederick,  MD,  USA).  We  used  chemoluminescence-based 
labeling  and  detection  methods  and  the  level  of  expression  of  each  gene  was  quantified 
using  the  GenePix  software  (Axon,  Union  City,  CA).  The  quantified  values  were 
normalized  with  cyplophilin  A  (present  on  each  of  the  GEArray)  and  analyzed  using  the 
GEArrayAnalyzer  software.  Our  data  suggest  that  the  genome  of  CaP  cells  display 
mutation  mechanisms  in  DNA  mismatch  repair  gene  associated  with  genetic  instability. 
Significantly,  RAD51  dysfunction  suggests  a  molecular  mechanism  to  account  for  the 
most  frequent  chromosomal  aberration  in  CaP:  isochromosome  8q  (see  Appendix  1).  It  is 
suggested  that  failure  in  the  fidelity  of  homologous  recombination  within  the  repetitive 
sequences,  that  comprise  the  kinetochore  complex,  could  lead  to  recuurent  loss  of  8p  and 
gain  of  8q  by  rearrangement  of  chromosome  8-specific  alphoid  centromeric  sequences. 
Thus,  the  high  fidelity  process  of  homologous  recombination  can  be  the  major  DNA 
repair  pathway,  which  is  indispensable  for  the  maintenance  of  genetic  stability  (task  e). 
This  study  is  in  preparation  for  publication  (Appendix  2). 

Consequently,  we  anticipate  performing  real-time  RT-PCR  validation  on  several 
important  genes.  For  the  purpose  of  this  validation,  appropriate  primers  for  the  genes  of 
interest  will  be  designed  using  the  guidelines  provided  by  Applied  Biosystems 
(Streetsville,  ON).  The  real-time  RT-PCR  will  be  carried  out  in  two  separate  steps, 
reverse  transcription  using  Superscript  II  RNase  H'  Reverse  Transcriptase  (Invitrogen, 
Burlington,  Ontario,  Canada)  followed  by  quantitative  PCR  using  the  SYBR  Green 
reagents.  Incorporation  of  the  SYBR  Green  dye  into  PCR  products  will  be  monitored  in 
real  time  with  the  ABI  PRISM  7700  sequence  detection  system  (Applied  Biosystems), 
thereby  allowing  determination  of  the  threshold  cycle  (Or),  the  point  at  which 
exponential  amplification  of  PCR  products  begins.  The  Or  values  for  the  housekeeping 
gene  cyclophilin  A  will  be  used  to  calculate  the  relative  transcript  abundance  for  the 
genes  of  interest. 

Immunohistochemistry/Expression  Level  of  Proteins  using  Tissue  Microarrays 

The  experimental  design  of  protein  validations  of  differential  expression  is  investigating 
qualitative  and  quantitative  variation  in  proteins  such  as  TP53,  hTERT,  PTEN  and  other 
proteins  in  HPIN  that  exhibit  a  significant  association  with  telomere  erosion,  acquisition 
of  CIN,  ploidy  change  etc.  Our  rationale  for  this  component  of  the  study  is  that  some  of 
the  established  molecular  pathways  suggested  by  our  expression  studies  may  involve 
alterations  in  protein  phosphorylation  and  kinase  signaling  detectable  at  the  protein  levels 
(as  described  below).  It  is  clear  that  many  such  activating  covalent  modifications  will 
alter  protein  function,  but  they  may  not  be  detectable  by  conventional  analysis  of  RNA 
expression  levels.  We  are  currently  selecting  well-characterized  antibodies  suitable  for 
detailed  investigating  of  pathways  of  interest. 

Fluorescence  in  situ  Hybridization  (FISH)  and  Immunohistochemistry  of  PTEN 
using  Tissue  Microarrays 

Phosphatase  and  tensin  homolog  (PTEN),  is  a  tumor  suppressor  protein  that  inhibits  the 
phosphatidylinosital-3-kinase  (PI3K)/Akt/protein  kinase  B  (PKB)  pathway  and  is  one  of 
the  most  frequently  deregulated  genes  in  a  wide  range  of  cancers.  Although  somatic 
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PTEN  alterations  have  been  reported  in  prostate  cancer,  including  loss  of  heterozygosity 
or  homozygous  deletions,  point  mutations,  and  promoter  hypermethylation,  the 
relationship  between  genomic  alterations  of  PTEN  and  prostatic  neoplasia  remains 
unclear.  Archival  formalin-fixed,  paraffin  embedded  tissues  from  25  radical 
prostatectomy  specimens  were  used  for  tissue  microarray  assembly.  Arrayed  tissues  were 
examined  by  H&E  staining  and  scored  according  to  Gleason  grading  criteria.  Standard 
dual-color  FISH  was  performed  using  commercially-available  DNA  probes  (Vysis  Inc., 
Markham,  ON,  Canada)  for  band  10q23  (PTEN  locus)  and  band  region  lOpll.l-qll.l 
(centromere  of  chromosome  10).  At  least  100  nonoverlapped  intact  interphase  nuclei 
were  scored  in  areas  of  cancer,  HPIN,  benign  glandular  epithelium  and  stroma.  The 
signal  counts  for  areas  of  neoplasia  were  compared  against  those  obtained  from  benign 
and  stromal  cell  nuclei  using  standard  statistical  methods.  Groups  of  6/6  (100%)  cases  of 
benign  hyperplasia  and  16/16  cases  (100%)  of  PIN  were  found  to  be  PTEN  deletion¬ 
negative  while  17/25  (68%)  of  cancer  samples  were  PTEN  deletion-positive  (P  <  0.01). 
In  addition,  a  correlation  was  observed  between  PTEN  deletion  status  and 
immunohistochemical  PTEN  analysis.  These  observations  support  the  hypothesis  that  a 
relative  imbalance  of  PTEN  is  likely  to  be  an  important  factor  in  the  PIN  progression  to 
prostate  cancer.  This  current  study  has  been  prepared  to  publication.  See  illustration  of 
PTEN  expression  analysis  by  IHC  in  Appendix  3  (Figure  1)  and  PTEN  gene  copy 
number  analysis  by  FISH  in  Appendix  4  (Figure  2). 


KEY  RESEARCH  ACCOMPLISHMENTS 

•  Integration  of  telomere  erosion,  genomic  alterations  and  DNA  damage  response 
expression  analysis  to  investigate  mechanisms  of  early  progression  in  prostate  cancer. 
Detection  of  PTEN  regional  deletion  and  implication  that  isochromosome  8q  is  the  most 
frequent  event  in  CaP  and  HPIN  (Appendix  1) 

•  Identification  of  topographical  variation  of  differentially  expressed  genes  in  CaP 
associated  with  telomere  dysfunction  and/or  CIN  and  identification  of  RAD51  as 
potential  mechanism  leading  to  8q  isochromsome  formation  (Appendix  2) 

•  Beheshti  B,  Vukovic  B,  Hughes  S,  Watson  SK,  Ishkanian  AS,  Lam  WL,  and  Squire 
JS.  High  resolution  genomic  analysis  of  prostate  cancer  using  CGH  arrays  and  molecular 
cytogenetic  methods.  95th  Annual  Meeting  of  the  American  Association  for  Cancer 
Research,  Orlando,  FL,  USA,  March  2004  (poster  presentation). 

•  Yoshimoto  M,  Cutz  J-C,  Bayani  J,  Evans  AJ,  Zielenska  M  and  Squire  JA.  Detection 
of  PTEN  Deletion  in  Prostate  Cancer  but  not  Prostatic  Intra-epithelial  Neoplasia  (PIN)  by 
Fluoresecence  In-situ  Hybridization  (FISH)  on  Tissue  Microarrays.  94th  Annual  Meeting 
of  the  United  States  and  Canadian  Academy  of  Pathology,  San  Antonio,  TX,  February- 
March,  2005  (poster  presentation). 

REPORTABLE  OUTCOMES 

Hughes  S,  Yoshimoto  M,  Evans  A,  Beheshti  B,  Squire  JA.  The  use  of  whole  genome 
amplification  to  study  genomic  instability  in  prostate  cancer  (In  preparation,  SEE 
APPENDIX  1) 
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Yoshimoto  M,  Albert  M,  Wong  L,  Hughes  S,  Macgregor  P,  Squire  J,  Evans  A.  Pathway- 
specific  array  analysis  of  DNA  Damage  Signaling  in  Prostate  Cancer  (In  preparation, 
SEE  APPENDIX  2) 

Yoshimoto  M,  Cutz  J-C,  Nuin  PAS,  Bayani  J,  Zielenska  M,  Evans  AJ,  Squire  JA. 
Detection  of  PTEN  Deletion  in  Prostate  Cancer  but  not  Prostatic  Intra-epithelial 
Neoplasia  by  Fluoresecence  in  situ  Hybridization  on  Tissue  Microarrays  (In preparation ) 

Hughes  S,  Lim  G,  Beheshti  B,  Bayani  J,  Marrano  P,  Huang  A,  Squire  JA.  Use  of  whole 
genome  amplification  and  comparative  genomic  hybridisation  to  detect  chromosomal 
copy  number  alterations  in  cell  line  material  and  tumour  tissue.  Cytogenet  Genome  Res. 
2004  (SEE  APPENDIX  5) 

Hughes  S,  Ameson  N,  Done  S,  Squire  JA.  The  use  of  whole  genome  amplification  in  the 
study  of  human  diseases.  Progress  in  Biophysics  &  Molecular  Biology.  2004  ( SEE 
APPENDIX  6) 

CONCLUSIONS 

We  have  determined  patterns  of  genomic  alterations  and  expression  findings  derived 
from  pathways-specific  DNA  damage  signaling  array  (task  d).  We  have  optimized  RNA 
amplification  technique  with  high  fidelity  and  reproducibility  since  our  analysis  is  based 
on  a  few  thousand  cells  microdissected  tissue  (task  c).  We  are  rapidly  developing 
topographical  maps  of  telomere  loss,  genomic  instability  and  concomitant  changes  in 
DNA  repair  and  damage  response  signaling.  In  addition  one  gene  PTEN  that  has  emerged 
from  these  analyses  is  being  validated  at  the  copy  number  and  protein  level  (task  f).  Our 
next  phase  is  to  expand  validation  of  our  expression  findings  at  the  RNA  and  protein 
level,  and  to  determine  which  molecular  pathways  are  the  most  predictive  of  prostate 
cancer  onset  (task  g). 
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APPENDIX  1 

The  use  of  whole  genome  amplification  to  study  genomic  instability  in  prostate  cancer. 
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Abstract 

Prostate  cancer  is  a  multifatorial  disease  with  genetic  and  environmental 
components  involved  in  its  etiology.  The  knowledge  of  genetic  basis  of  CaP  has 
increased  over  the  past  years,  mainly  in  the  pathways  that  underlie  tumorigenesis  and 
growth  and  drug  resistance  for  each  clinical  stage.  Histologically  prostate  cancer  is  a 
heterogeneous  disease  consisting  of  cancerous  and  pre-malignant  lesions  interspersed  by 
normal  non-cancerous  tissue.  This  heterogeneity  can  complicate  genomic  analysis,  as 
when  DNA  is  bulk  extracted  from  such  tissue  the  genetic  information  obtained  represents 
an  average  for  all  of  the  cells  within  the  sample.  To  minimize  this  problem  we  obtained 
DNA  from  individual  regions  of  High-grade  prostatic  intraepithelial  neoplasia  (HPIN) 
and  prostate  cancer  (CaP)  by  laser  capture  microdissection.  The  small  quantities  of  DNA 
material  thus  obtained  were  then  amplified  by  means  of  strand  displacement 
amplification  (SDA)  for  use  in  genomic  DNA  array  comparative  genomic  hybridisation 
(gaCGH).  Recurrent  chromosome  copy  number  abnormalities  (CNAs)  where  observed  in 
both  HPIN  and  CaP.  In  HPIN,  chromosomal  imbalances  involving  chromosome  8  where 
common,  whilst  in  CaP  additional  chromosomal  changes  involving  chromosomes  6,  10, 
13  and  16  where  also  frequently  observed.  An  overall  increase  in  chromosomal  changes 
was  seen  in  CaP  compared  to  HPIN  suggesting  a  universal  breakdown  in  chromosomal 
stability.  The  recurrent  observation  of  8p  loss  and  concomitant  8q  gain  implicates 
isochromosome  8q  as  a  frequent  event  in  HPIN  and  CaP.  The  combination  of  LCM,  SDA 
and  gaCGH  is  ideally  suited  for  the  identification  of  CNAs  from  small  cell  clusters  and 
may  help  to  discover  potential  genomic  markers  for  early  diagnosis  or  identify  the 
location  of  tumour  suppressor  genes  (TSG)  or  oncogenes  previously  unreported  in 
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prostate  tumourigenesis,  as  well  as  to  investigate  the  potential  role  of  chromosomal 
instability  (CIN)  in  preneoplasia  associated  with  progression  in  prostatic  cancer. 
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Introduction 

The  worldwide  occurrence  of  CaP  is  on  the  increase,  with  it  now  having 
overtaken  lung  cancer  as  the  most  commonly  diagnosed  malignancy  of  men  in  the 
Western  World.  In  spite  of  significant  progress  in  its  clinical  management  comparatively 
little  is  known  about  the  aetiology  of  the  disease  and  predicting  outcome  is  still  very 
complicated  for  most  patients.  Histopathologically,  CaP  displays  considerable 
heterogeneity  and  can  contain  several  premalignant  as  well  as  cancerous  foci.  The 
presence  of  HPIN,  a  putative  precursor  of  CaP,  is  currently  considered  the  most  likely 
precursor  to  invasive  prostatic  adenocarcinoma  (van  der  Kwast  et  al.,  1999).  However, 
most  of  these  foci  are  latent  and  rarely  develop  into  clinical  cancer.  Unfortunately  a 
significant  number  (3%)  do  progress  and  can  become  aggressive  and  lethal  (Al-Maghrabi 
et  al.,  2002).  The  major  difficulty  facing  clinicians  is  the  identification  of  those  patients 
presenting  at  an  early  stage  who  are  likely  to  develop  life-threatening  disease.  As  a  result, 
intensive  research  is  currently  underway  to  identify  the  key  alterations  that  may  prove  to 
be  important  for  both  classification  and  prognosis  prediction. 

The  advent  and  development  of  CGH,  initially  described  in  the  early  1990’s 
(Kallioniemi  et  al.,  1992),  has  revolutionised  cytogenetics  and  allowed  for  genome-wide 
screening  of  numerous  cancer  types  and  the  identification  of  genomic  copy  number 
alterations  (CNA)  associated  with  specific  tumour  types,  stages  or  outcomes  (Squire  et 
al.,  2002;  Cleton-Jansen  et  al.,  2004;  Kimura  et  al.,  2004).  When  first  introduced,  CGH 
used  metaphase  chromosomes  targets  to  identify  CNAs,  however,  recent  advance  have 
substituted  these  with  arrayed  DNA  sequences  that  provide  higher  resolution,  1Mb  versus 
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10Mb,  and  greater  ease  of  analysis.  (Hughes  et  al,  2004a)  Whole  genome  scans  of  CaP 
patients  and  cell  lines  have  identified  consistent,  chromosomal  alterations,  which  include 
recurrent  loss  of  chromosomal  regions  from  5p,  6q,  8p,  lOq,  13,  16q  and  17p,  in  addition 
to  gain  on  lq,  3q  7p,  7q,  8q  1  lp,  17q,  Xp  and  Xq  (Wolf  et  al.,  2004;  van  Dekken  et  al., 
2003;  Clark  et  al.,  2003).  In  addition,  genotypic  heterogeneity  and  multiple  foci  have 
been  suggested  to  be  a  likely  mechanism  underlying  clonal  evolution  of  PIN  (Bostwick  et 
al.,  1998).  The  accumulation  of  genetic  changes  that  occurs  during  the  stepwise  evolution 
from  normal  tissue  to  metastasis,  although  likely  due  to  increased  genetic  instability,  may 
indicate  the  chromosomal  locations  of  TSG  or  oncogenes  that  are  important  in 
tumourigenesis. 

i 

In  the  present  study  we  describe  the  use  of  LCM,  SDA  and  gaCGH  to  investigate 
copy  number  changes  occurring  in  cancer  by  the  analysis  of  8  prostatectomy  specimens 
with  HPIN  and  8  with  CaP.  Understanding  of  the  molecular  mechanisms  behind  this 
process  will  enable  identification  of  molecular  and  cytogenetic  biomarker(s)  that  are  able 
to  predict  early  transformation  into  an  aggressive  phenotype,  as  well  as,  fundamental 
insights  concerning  the  regulatory  pathways  of  the  integrity  of  genome  that  may  lead  to 
multistep  field  cancerization  and  preneoplastic  HPIN  and  CaP. 

Materials  and  Methods 

Tissue  accrual 

Fresh  prostate  tissue  was  obtained  from  radical  prostatectomies  performed  at  The 
University  Health  Network  (UHN),  Toronto.  The  tissue  was  embedded  in  Frozen  Section 
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Medium  (Stephens  Scientific,  Riverdale  NJ,  USA).  Prostate  tissue  samples  were  cut  onto 
microscope  slides  (8pm  thickness)  and  stained  using  the  HistoGene  LCM  frozen  section 
staining  kit  (Arcturus,  USA). 

Cohort  selection 

Patient  samples  for  gaCGH  analysis  were  selected  dependent  on  two  criteria:  the 
presence  of  (1)  clearly  identifiable  regions  of  HPIN  or  CaP  using  adjacent  hematoxilin 
and  eosin-stained  sections  as  guidance  and  (2)  DNA  of  high  molecular  weight  provided 
from  15  patient  samples  for  analysis.  DNA  quality  was  assessed  by  DNA  extraction  from 
a  single  tissue  section  and  visualisation  by  gel  electrophoresis. 

Laser  capture  microdissection  and  DNA  Extraction 

A  minimum  of  1000  cells  was  removed  from  foci  of  HPIN  and  CaP  by  laser 
capture  microdissection  (LCM)  using  the  Cell  Robotics  LaserScissors  system  (Cell 
Robotics  Inc.,  USA).  DNA  was  extracted  from  the  dissected  tissue  using  the  QIAamp 
DNA  Micro  Kit  (Qiagen,  USA)  and  the  DNA  concentration  was  determined  using  the 
PicoGreen  dsDNA  Quantitation  kit  (Molecular  Probes  Inc.,  USA).  Both  procedures  were 
performed  following  manufacturers  instructions. 

Strand  displacement  amplification 

DNA  was  amplified  using  the  GenomPhi  Amplification  Kit  (Amersham 
Biosciences,  USA)  according  to  manufacturer’s  instructions  (Hughes  et  al.,  2004b).  This 
method  of  whole  genome  amplification  consistently  produced  in  excess  of  5pg  of  DNA 
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from  a  starting  concentration  of  5-10ng.  Briefly,  amplification  was  carried  out  in  two 
individual  steps.  The  step  1  reaction  mixture  contained  5-1  Ong  (1000  to  2000  cell 
equivalents)  of  DNA  (from  diluted  reference  or  test  DNA)  in  1  p.1  of  sterile  water  and  9  pi 
of  Sample  Buffer.  This  mixture  was  heated  at  95°C  for  3  minutes  and  then  chilled  on  ice. 
The  step  2  reaction  (amplification)  mixture  contained  9pl  of  Reaction  Buffer,  1  pi  of 
Enzyme  Mix  and  the  lOpl  from  Step  1.  The  amplification  reaction  was  incubated  at  30  °C 
for  16-18  hours.  The  enzyme  was  inactivated  by  heating  at  65  °C  for  10  minutes, 
followed  by  cooling  to  4  °C. 

Assessment  of  DNA  quality  and  strand  displacement  amplification 

Five  microlitres  of  each  amplification  reaction  was  electrophoresed  through  a  1% 
agarose  gel  and  stained  with  ethidium  bromide  in  order  to  assess  product  yield  and 
product  length.  All  amplification  products  were  purified  by  phenol-chloroform  extraction 
and  DNA  concentration  and  purity  were  determined  by  measuring  absorbance  at  A260 
and  A280. 

Spectral  Genomic  BAC  arrays 

The  genomic  DNA  arrays  used  in  these  experiments  were  obtained  from  Spectral 
Genomics  Inc.  and  consist  of  2500  unique  BAC  and  PAC  clones,  which  provide  an 
average  genomic  resolution  of  1  Mb. 

CGH  experiments  were  performed  using  the  amplified  patient  DNA  as  the  “test” 
and  amplified  placental  DNA  as  the  “normal”.  In  addition,  two  control  experiments  were 
performed,  one  corresponding  to  amplified  non-neoplastic  prostate  epithelial  cells  versus 
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amplified  placental  DNA  and  one  corresponding  to  a  comparison  of  the  amplified 
placental  DNA  against  itself.  No  CNAs  were  observed  for  either  of  the  control 
experiments.  Though  CNA  information  was  available  for  both  the  X  and  Y  chromosome 
this  data  was  not  included  in  these  analyses. 

DNA  labeling 

Reference  DNA  (4pg)  and  test  DNA  (4pg)  were  first  digested  overnight  at  37°C 
using  10  units  of  Rsal  (Invitrogen)  in  a  lOpl  reaction  and  then  purified  using  the 
QIAquick  PCR  Purification  kit  (Qiagen).  The  digested  DNA  was  labeled  using  the 
protocol  optimized  by  Spectral  Genomics,  with  separate  labeling  reactions  for  Cy3  and 
Cy5  being  set  up  for  both  reference  and  test  DNA.  Briefly,  labeling  reaction  were  set  up 
containing  2pg  of  DNA,  20pl  of  2.5X  random  primer/reaction  buffer  mix  (Invitrogen) 
and  sterile  water  up  to  a  final  volume  of  45jj.1.  The  reaction  mix  was  boiled  for  5  minutes 
prior  to  cooling  on  ice  and  the  addition  of  2.5pl  of  Spectral  labeling  buffer  (Spectral 
Genomics,  Houston,  U.S.A),  1.5pl  of  either  Cy3-dCTP  (ImM,  Applied  Biosystems, 
Foster  City,  U.S.A)  or  Cy5-dCTP  (ImM,  Applied  Biosystems)  and  lp.1  of  Klenow 
Fragment  (BioPrime  labelling  kit,  Invitrogen).  The  reaction  was  mixed  gently  and  then 
incubated  for  2  hours  at  37°C.  Following  incubation  the  reaction  was  stopped  by  the 
addition  of  5pl  0.5M  EDTA  (pH8)  and  heating  at  72°C  for  10  minutes. 

Hybridisation 

The  Cy3  labeled  test  DNA  was  combined  with  the  Cy5  labeled  normal  reference 
DNA  and  vice  versa.  Each  combined  probe  was  mixed  with  45pl  of  Spectral 
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Hybridisation  Buffer  (Spectral  Genomics),  11.3pl  of  5M  NaCl  and  110p.l  of  room 
temperature  isopropanol.  The  samples  were  incubated  in  the  dark  at  room  temperature  for 
10-15  minutes  and  centrifuged  at  16,000  x  g  for  10  minutes  and  the  supernatant 
discarded.  The  pellets  were  then  washed  with  500pl  of  70%  ethanol.  The  supernatant  was 
carefully  removed  and  the  pellets  air-dried  at  room  temperature  in  the  dark.  For 
hybridisation,  the  pellets  were  first  resuspended  in  lOpl  of  sterile  water  prior  to  being 
mixed  with  30pl  of  Spectral  Hybridisation  Buffer  II  (Spectral  Genomics)  by  pipetting. 

The  reconstituted  probes  were  then  incubated  at  72°C  for  1 0  minutes,  placed  on  ice  for  5 

/ 

minutes  and  then  incubate  for  30  minutes  at  37°C.  The  probes  were  hybridised  to  BAC 
arrays,  covered  with  a  22x60mm  coverslip  and  incubated  for  12-16  hours  at  37°C  in  a 
humidified  chamber. 

Washes 

The  wash  buffers,  with  the  exception  of  Wash  I  (2X  SSC,  0.5%  SDS),  were  pre¬ 
warmed  to  50°C.  The  slides  were  gently  dipped  into  and  out  of  Wash  I  until  the  coverslip 
detached  from  the  slide.  The  slides  were  then  washed  once  in  Wash  II  (2X  SSC,  50% 
deionized  Formamide,  pH  7.5)  for  20  minutes,  followed  by  successive  washes  in  Wash 
III  (2X  SSC,  0.1%  NP-40,  pH  7.5)  for  20  minutes  and  Wash  IV  (0.2X  SSC,  pH  7.5)  for 
10  minutes.  All  washes  were  performed  at  50°C,  with  the  exception  of  Wash  I.  The  slides 
were  briefly  submerged  in  distilled  deionized  water  for  5-10  seconds  and  centrifuged  for 
5  minutes  at  85  x  g  to  dry. 
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Data  collection  and  Analysis 

The  slides  were  scanned  using  an  Axon  GenePix  4000A  confocal  Scanner,  each 
fluorescence  signal  was  collected  separately  and  quantified  with  the  GenePix  Pro  3.0 
software  (Axon  Instruments,  U.S.A).  The  data  was  normalised  and  analysed  using 
Normalise  Suite  v2.4  (Beheshti  et  al.,  2003),  regions  of  loss  or  gain  were  determined  as 
those  that  were  2  standard  deviations  above  the  mean  baseline  for  each  separate  sample. 
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Results 

A  summary  of  the  chromosomal  CNAs  detected  by  CGH  for  the  8  HPIN  and  8  CaP 
samples  is  shown  in  Table  1  and  displayed  graphically  in  Figure  1.  Redlines  to  the  left  of 
the  chromosome  indicate  loss  in  the  tumour  and  green  lines  to  the  right  of  the 
chromosome  indicate  gain  in  the  tumour.  In  addition,  group  a)  corresponds  to  HPIN 
samples  and  group  b)  to  CaP  samples.  The  general  pattern  of  loss  or  gain  was  very 
similar  in  the  HPIN  and  CaP  samples,  however  CaP  samples  possessed  significantly  more 
aberrations  than  HPIN  samples  (p<0.0001,  chi  squared  test).  The  average  number  of 
CNAs  for  the  8  HPIN  samples  was  9.25  (range  3  to  32,  median  6.5,  although  7  of  the  8 
samples  were  in  the  range  3  to  8,  median  6).  For  the  8  dissected  CaP  samples  an  average 
of  11.25  (range  5  to  18,  median  11.5)  amplifications  or  deletions  were  observed.  In 
dissected  HPIN,  losses  (<  25%)  were  found  on  8p  (100%),  6q,  lOq,  13q,  16q  (37.5%, 
each),  4q,  1 8q  (25%,  each),  whereas  gains  (<  25%)  were  found  on  8q,  7p  (50%,  each), 
lp,  16p  (37.5%,  each)  and  12q,  20q  (25%).  For  the  8  dissected  cancer  samples  recurrent 
chromosomal  changes,  detected  in  greater  than  25%  of  tumours,  were  losses  on  8p 
(100%),  lOq  (75%),  13q  (62.5%),  6q  (50%),  lp,  16q  (37.5%),  5q,  12p,  18q  (25%,  each) 
and  gains  on  7p,  7q,  8q  (62.5%),  4p,  16p,  19p  (25%) 

Small  consensus  regions  of  consistent  CNA  were  observed  for  both  HPIN  and  CaP 
samples  (Table  2),  in  >25%  of  samples.  For  example,  8q23.1  -  q23.2  was  consistently 
gained  in  HPIN  (100%),  but  within  a  region  (8q21.1 1  -  qter)  that  was  commonly  gained 
in  >37.5%  of  the  HPIN  samples.  Similarly,  8q21.3  -  q23.1  was  consistently  gained  in 
CaP  (100%),  but  within  a  region  (8q21.1 1  -  qter)  that  was  commonly  gained  in  >50%  of 
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the  CaP  samples.  A  list  of  the  genes  present  within  these  consensus  or  commonly  gained 
regions  is  displayed  in  Table  2. 

Discussion 

In  this  study  we  have  used  a  2,400-element  BAC  microarray  with  a  resolution  of  ~1 
Mb  to  study  CNAs  in  a  set  of  16  patient  samples  comprised  of  8  HPIN  cases  and  8  CaP 
cases.  The  likely  reason  for  increase  in  CNAs  in  this  HPIN  lesion  is  its  close  proximity  to 
a  region  of  cancer  within  the  tissue.  Although  study  of  the  tissue  sections  was 
inconclusive,  it  is  possible  that  during  post  operative  tissue  processing,  adjacent  regions 
of  HPIN  and  cancer  may  have  been  separated  with  the  latter  residing  it  in  a  separate 
portion  of  tissue  that  we  did  not  have  access  to.  This  close  proximity  is  important  as 
previous  work  by  our  group  has  indicated  that  foci  of  HPIN  and  cancer  situated 
immediately  adjacent  to  one  another  demonstrate  a  higher  degree  of  telomeres  loss 
compared  to  HPIN  located  away  from  foci  of  CaP  (Vukovic  et  al.,  2003).  In  the  context 
of  CNAs  this  similarity  in  loss  of  telomeres  suggests  a  similar  level  of  chromosomal 
instability  in  the  HPIN  and  cancer  foci,  which  would  explain  the  increase  in  aberrations 
for  CaP  8. 

As  with  other  cancers,  CaP  development  and  progression  is  likely  to  be  the  outcome 
of  a  series  of  stepwise  genetic  changes.  The  accumulation  of  CNAs,  which  occurs  during 
this  process,  although  likely  due  to  increased  genetic  instability,  is  non-random  and  may 
indicate  chromosomal  regions  important  in  tumourgenesis.  Examination  of  our  array 
results  indicates  that  aberrations  involving  parts  or  all  of  lp,  6q,  7p,  7q,  8p,  8q,  lOq,  13q 
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16p  and  16q  are  most  common  and  thus  may  constitute  a  programmed  cycle  of  events 
that  promote  tumour  development,  progression  and  survival. 

A  comparison  of  the  CNAs  present  in  the  HPIN  and  CaP  samples  identified  a 
significant  increase  in  copy  number  for  7q  (p<0.01,  chi  squared  test)  and  a  significantly 
increased  frequency  of  loss  for  lOq  (p<0.01,  chi  squared  test)  and  13q  (pO.OOOl,  chi 
squared  test).  In  genotype/phenotype  correlations,  gain  of  chromosome  7q  (Paiss  et  al., 
2003)  and  loss  of  13q  (Dong  et  al.,  2001)  have  been  associated  with  advancing  tumour 
stage  and  aggressiveness.  Which  is  in  agreement  with  the  results  presented  here. 
However,  gain  of  8q  (Alers  et  al.,  2000)  and  loss  of  16q  (Matsuyama  et  al.,  2003)  have 
also  been  linked  to  tumours  progression  but  our  data  does  not  show  any  significant 
difference  for  these  CNAs  in  our  HPIN  and  CaP  samples  (8q,  p  =  0.2655;  16q,  p  = 
0.3375).  This  would  suggest  that  these  are  likely  to  be  early  events  in  the  tumourigenic 
process.  In  addition,  they  may  also  identify  HPIN  and  CaP  samples  that  are  likely  to 
progress. 

Additional  alterations  that  include  gains  on  12q  (HPIN)  and  loss  of  4q  (HPIN  and 
CaP)  have  also  been  identified.  Whether  the  identification  of  these  regions  will  provide 
additional  insight  into  CaP  progression  is  not  yet  clear.  However,  what  is  apparent  is  that 
the  combination  of  LCM,  WGA  and  CGH  used  here  has  made  such  identifications 
possible.  Further  analysis  using  a  platform  such  as  tissue  microarrays,  which  permit  the 
screening  of  different  disease  stages  from  large  patient  cohorts,  will  help  better  identify 
their  frequency  and  also  their  potential  use  in  diagnosis. 

Previous  reports  have  demonstrated  a  relationship  between  alterations  in 
chromosomal  copy  number  and  alterations  in  gene  expression  (Shayesteh  et  al.,  1999; 
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Collins  et  al. ,  1998).  The  ability  to  distinguish  these  regions  will  point  to  genes,  which 
may  either  directly  (TSG  loss  or  oncogene  gain)  or  indirectly  contribute  to  tumour 
development  and  progression.  As  a  result  CGH  can  be  used  as  a  surrogate  for  gene 
identification.  Candidate  genes  (Table  2),  which  have  previously  been  implicated  in  CaP, 
include  CASP3  (4q),  RAD21,  MYC,  PSCA,  NBS1  (all  8q),  PTEN,  MXI1,  ANXA7  (all 
lOq),  MDM2  (12q),  CDH1,  DERPC,  WWOX  (all  16q)  and  MMP9,  E2F1  (both  20q). 
The  roles  of  MYC  (Cassinelli  et  al.,  2004;  Savinainen  et  al.,  2004;  Bernard  et  al.,  2003), 
PSCA  (Zhigang  et  al.,  2004;  Fuessel  et  al.,  2004;  Reiter  et  al.,  1998),  PTEN  (Dreher  et 
al.,  2004;  Fenic  et  al.,  2004;  Koksal  et  al.,  2004)  and  MDM2  (Chang  et  al.,  2004;  Mu  et 
al.,  2004;  Leite  et  al.,  2001)  have  all  been  well  reported  and  alterations  in  gene  dosage 
correlate  well  with  their  change  in  expression.  However,  other  candidate  genes  have  been 
less  well  studied.  For  example,  EBAG9  whose  increased  expression  in  CaP  is  a  negative 
prognostic  indicator  has  a  potential  role  in  progression  by  enabling  cancer  cells  to  evade 
the  immune  response  (Takahashi  et  al.,  2003).  NBS1,  which  has  been  identified  as  a 
founder  mutation  causing  an  increased  susceptibility  to  prostate  cancer  (Cybulski  et  al., 
2004),  is  involved  in  processing/repair  of  DNA  double  strand  breaks  and  in  cell  cycle 
checkpoints,  thus  its  dysregulation  will  further  contribute  to  chromosomal  instability. 
FKHR,  which  is  a  member  of  the  FOXO  forkhead  transcription  factor  family,  is  thought 
to  play  a  regulatory  role  in  several  cellular  functions  including  cell  proliferation  and 
survival  (Huang  et  al.,  2004).  Loss  of  FKHR  expression,  as  observed  in  CaP  cell  lines,  is 
likely  to  abrogate  this  control  leading  to  tumour  cell  growth.  Though  these  genes  have 
previously  been  implicated  in  CaP  there  are  additional  genes,  including  both  oncogenes 
and  tumour  suppressor  genes,  which  reside  within  all  of  the  affected  regions  that  may 
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play  an  important  role  in  the  aetiology  of  the  disease.  Further  analysis  of  these  other 
candidates,  may  identify  their  potential  as  molecular  targets  for  diagnosis  and  treatment. 

The  process  of  telomere  maintenance  has  previously  been  implicated  in  prostate 
cancer  and  involves  a  series  of  “telomere  associated”  proteins  known  as  the  telosome 
complex  (Liu  et  al.,  2004).  The  data  presented  here  indicates  that  regions  of  the  genome 
that  are  commonly  affected  by  alterations  in  gene  dosage  in  CaP  contain  members  of  this 
complex,  including  TERF1  (8ql3),  TERF2  (16q22.1),  TNKS  (8p23.1),  PTOP  (16q22.1), 
POT1  (7q31.33),  NBS1  (8q21)  and  PINX1  (8p23).  This  finding  indicates  that  loss  or  gain 
of  genetic  material  has  a  potentially  crucial  role  in  the  dysregulation  of  telomere 
maintenance.  The  net  result  of  these  alterations  in  the  telosome  complex  is  the  massive 
chromosomal  instability  characteristic  of  CaP. 

In  summary,  the  combination  of  techniques  used  in  this  study  has  allowed  for  the 
identification  of  consistent  regions  of  copy  number  change,  ranging  from  specific 
cytobands  to  whole  chromosomes,  starting  with  as  little  as  5-1  Ong  of  DNA.  With  the  use 
of  laser  capture  microdissection  and  whole  genome  amplification  we  have  been  to  able 
obtain  pure  HPIN  and  CaP  DNA  and  thus  identify  the  particular  chromosomal  changes 
associated  with  the  two  disease  stages.  In  addition,  we  have  identified  several  lower 
frequency  CNAs,  which  may  not  have  been  detected  when  using  bulk  extracted  DNA  due 
to  the  heterogeneous  nature  of  prostate  tissue.  In  silico  analysis  of  regions  of  copy 
number  change  has  identified  several  interesting  candidate  genes  and  although  some  have 
already  been  extensively  studied  others,  which  have  not,  may  prove  to  be  of  clinical 
importance.  Genetic  screening  strategies  that  combine  FISH  and  immunohistochemistry 
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for  the  detection  of  various  combinations  of  chromosomal  gains  and/or  losses  and  altered 

gene  expression  are  likely  to  be  of  great  use  in  diagnosis  and  prognosis  prediction. 
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Table  1.  Chromosomal  changes  in  microdissected  HPIN  and  CaP  DNA  samples. 


Tumor 

Type 

Loss 

Gain 

CaP  1 

HPIN 

3ql  3 .33-q2 1.1 ;  6p24. 1  -pter;  8p  1 1 . 1  -p23 .2;  1 3ql  4.3- 
q31.2 

Ip21.3-p31.1;  2q24.3-q3 1.1;  8q23.3- 
qter;  Ilpl4.3-pl5.1 

CaP  2 

HPIN 

6ql4.1-ql6.3;  8p23.1-pter;  16q21-qter 

SWHHHi 

CaP  3 

HPIN 

2p23.1-p25.2;  4q22.1-q34.1;  8pll.23-pter 

3q21.2-q24;  12ql4.3-q21.33 

CaP  4 

HPIN 

8p23.1-pter; 

7p22.1-pter;  20qll.22-ql3.ll 

CaP  5 

HPIN 

6ql6.3-q22.1;  8pl  1 .21-pter;  13ql3.1-ql4.3 

7p21.3-p22.1;  8q21.1 1-qter;  16pl2.3- 
pter;  20q  1 1 .23-q  13.2 

CaP  6 

HPIN 

8p2 1.1 -pter;  10q21.3-q23.1;  18q21.32-q22.2; 
21q21.3-qter 

7q32.3-q36.1;  16pl2.3-pl3.13; 
20pll.23-pl2.1 

CaP  7 

HPIN 

8pll.23-pter;  10q23.31-q25.1;  16q22.1-qter 

16pl2.2-pter;  17q24.2-qter 

CaP  8 

HPIN 

Ipl3.1-p22.1;  2ql4.2-q21.1;  2q21.3-q23.3;  3pl2.3- 
ql  1.1;  4q32.3-q35.1;  5pl4.3-q22.1;  6ql4.2-q22.33; 

8ql  1.1-pter;  9q34-qter;  lOql  1.1-ql  1.23;  10q23.2- 
q26.12;  Ilq22.2-q24.3;  13ql3.3-ql4.13;  16ql2.2- 
qter;  1 8ql  1.1-qter 

8ql  1.1-qter; 

CaP  9 

Cancer 

2ql4.2-q21.2;  5ql4.1-q22.1;  6ql6.1-q21;  8p21.3- 
p23.1;  10q22.1-q23.33;  13ql4.1  l-q22.2 

4q21.21-q21.23;  7pl5.3-p21.3; 

7q  11.23 

CaP  10 

Cancer 

Ip21.1-p32.3;  2q33.1-q33.3;  5ql4.1-q21.3;  6ql6.1- 
q22.1;  8pl  1.23-pter;  10q23.31-qter;  13ql2.13-ql4.ll 

4pl3-pl5.1;  4q22.2-q24;  7q31. 1-qter; 

8q21.1 1-qter;  9p22.3-p24.1;  14ql3.1- 
q21.3 

CaP  11 

Cancer 

2q24.2-q3 1.1;  7q22.3-qter;  8pll.22-pter;  10q22.2- 
q23.33;  1 8q  1 1 .  l-q22.3 

CaP  12 

Cancer 

8pll.21-pter;  10q22.2-q23.33;  12pl3.2;  13ql3.3- 
q32.2;  16q24.1-qter;  22ql  1.21-qter 

lq32.1;  5q14.2-ql4.3;  8ql3.3-q21.2 

CaP  13 

Cancer 

Ipl3.1-pl3.2;  lp22.3;  2pl6.3;  6ql2-ql6.1;  8p21.1- 
pter;  10q25.3-q26.1 1 

7p22.1-pter;  7ql  1 .23;  8q22.3-qter; 
16pl2.2-pter;  19p;  19ql3.33-qter 

CaP  14 

Cancer 

6p24.1-pter;  8pl  1.21 -pter;  1 0q23 .3 1  -q23 .32; 
15q22.33-q24.1 

Ip21.3-p32.1;  3q25.33-q27.3;  4pl3- 
pl5.1;  4ql3.1-ql3.3;  7pl5.3-p22.1; 
7q21.11-q21.3;  7q31.1-q31.33; 
8qll.22-qter;  llql4.1-14.3 

CaP  15 

Cancer 

Ipl3.1-pl3.2;  lp22.3;  6ql6.1-ql6.3;  8p21.2-p23.1; 
8ql2-qter;  1 0q2 1 . 1 -q21 .2;  10q23.1-q23.32;  10q25.2- 
q26.1 1;  1  lq22.3-qter;  12pl3.2;  13ql2.3-q21.33; 
15ql3.1-q21.3;  16ql2.2-qter 

7p22.1-pter;  7ql  1 .23;  8p23.2-pter; 
18q21.1;  20ql  1 .21 -ql3. 12 

CaP  16 

Cancer 

3q26.33-q27.3;  8pl  1.22-p23.2;  13ql3.3-qter; 
16q23.1-qter;  18q21.33-q22.1 

Ip35.2-p36.11;  7p22.1-p22.2;  7ql  1 .23; 
15ql  1.2-q22.32;  1 6pl2.2- 1 3. 1 1 ; 

19pl3.1 1-pter 
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Table  2.  Consensus  regions  of  copy  number  gain.  Names  in  bold  indicate  those  genes 
that  have  been  implicated  in  prostate  cancer.  The  *symbol  indicates  the  same  candidate 
genes  from  HPIN. _ _ 


Chromosome 

Region 

Frequency 

Candidate  genes 

HPIN  Gain 

lp 

lp22.3  —  p3 1 . 1 

37.5% 

IGFBP10 

2q 

2q24.3  —  q3 1 . 1 

25% 

TLK1,  ITGA6 

7p 

7p22.1 

37.5% 

NUDT1,  PDGFA 

7pl5.1  -pl5.3 

25% 

GPNMB,  AHR 

8q 

8q2 1.11  -qter 

>37.5% 

PTK2,  RAD21,  MLZE,  WISP1,  NOV,  ENPP2,  MYC, 

PSCA,  PTP4A3,  KCNK9,  TPD52,  MMP16,  NBS1, 

FABP5,  E2F5,  BAALC,  EBAG9 

12q 

12ql4.3-q21.33 

25% 

MDM2 

16p 

16pl2.3  —  pl3.13 

37.5% 

BFAR 

20q 

20ql  1.23 -ql3.il 

25% 

MMP9,  C20ORF1,  SRC,  GHRH,  E2F1,  DNLC2A, 
BASE,  CDC91L1,  WFDC2,  SLPI,  CYP24,  BMP7, 

CSE1L 

HPIN  Loss 

4q 

4q34.1 

25% 

FAT,  MORF4,  CASP3 

6q 

6ql6.3 

37.5% 

SFRP1,  NKX3A,  TRIM35,  REAM,  RB1CC1, 

8p 

8pl  1.23  -  p23.3 

>50% 

PDGFRL,  FGL1,  TNFRSF10B,  LZTS1,  DLC1, 
MTSG1,  TUSC3,  FLJ32642,  MTSS1,  PINX1, 

DEFB1,  CSMD1,  TNKS 

10q 

10q23.1 

25% 

10q23.2  -  q25.1 

25% 

PTEN,  MXI1,  LGI1,  PDCD4,  LAPSER1,  RNF27, 

SUFU,  CASP7,  LIMAB1,  NEURL 

13q 

1 3ql  3. 1  —  ql  4. 1 3 

RFP2,  TSC22,  DBM,  DDX26,  KCNRG 

13ql4.3 

25% 

DLEU2,  DLEU1,  CHC1L,  FAM10A4,  FKHR 

16q 

16q22.1  -  qter 

37.5% 

PTOP,  TERF2,  CDH1,  DERPC,  WWOX,  OKL38, 
CBFA2T3,  CDH13,  WFDC1,  MAF,  FOXF1,  MVD 

18q 

1 8q2 1 .32  -  q22.2 

25% 

PMAIP1 

CaP  Gain 

4p 

4pl3  —  pl5.1 

25% 

UCHL1,  CD38 

7p 

7p22.1 

50% 

* 

7q 

7ql  1.23 

50% 

LIMK1,  CLDN4,  HSPB1 

7q31.1-q31.33 

25% 

NRCAM,  PTPRZ1,  POT1 

8q 

8q2 1.11  -qter 

>50% 

* 

16p 

16pl2.2  -pl3.13 

25% 

* 

19p 

19pl3.ll  -  pter 

25% 

VAV1,  RAB3D,  ELAVL1,  JUN-B,  JUN-D,  EPOR. 

DRIL1,  BSG,  ANGPTL4 

CaP  Loss 

lp 

lp22.3 

37.5% 

lpl3.1  -pl3.2 

25% 

ST7L 

5q 

5ql4.2  -  ql4.3 

25% 

6q 

6ql6.1 

50% 

8p 

8pl  1 .22  —  p23.3 

>50% 

* 

lOq 

10q22.2  -  q23.33 

>37.5% 

UNC5B,  BMPR1A,  BLNK,  PTEN 

10q25.3  —  q26. 1 1 

37.5% 

DMBT1,  TACC2,  WDR11,  FGFR2,  DEC,  BCCIP 

12p 

12pl3.2 

25% 

CD9 

I3q 

13ql3.3-q26.11 

>50% 

BRCA3,  KLF5  * 

16q 

16q24.1  -  qter 

37.5% 

* 

18q 

1 8q2 1 .33  -q22. 1 

25% 

* 

*  indicates  the  same  candidate  genes  from  HPIN.  Names  in  bold  indicate  those  genes  that 
have  been  implicated  in  prostate  cancer. 


Figure  1.  Chromosomal  alterations  observed  for  a)  HPIN  and  b)  CaP  DNA  samples.  Green  bars  to  the  left  of  the  chromosomes 
indicate  regions  of  loss  and  red  bars  to  the  right  of  the  chromosomes  indicate  regions  of  gain. 
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Abstract 

Prostate  cancer  (CaP)  has  become  the  most  commonly  diagnosed  cancer  in  men  in 
North  America.  Understanding  of  the  molecular  mechanisms  behind  the  development  of 
CaP  will  enable  the  identification  of  differential  molecular  biomarker(s),  as  well  as, 
fundamental  insights  concerning  the  regulatory  pathways  of  the  integrity  of  genome  that 
may  lead  to  multistep  field  cancerization.  Human  DNA  damage  signaling  pathway- 
specific  array  of  1 1  amplified  aRNA  from  ten  radical  prostatectomy  specimens  and  one 
normal  control  were  used  for  gene  expression  assembly  to  ATR/ATM  signaling  network 
and  transcriptional  targets  of  DNA  damage  response.  The  expression  profiling  of  96  key 
genes  was  categorized  into  three  functional  groups:  cell  cycle  arrest,  apoptosis,  and 
genomic  stability  and  repair  pathways.  Overall,  two  tumors  clusters  were  obtained  using 
TMEV  software  version  3.1,  whereas  one  of  these  groupings  was  divided  in  two  more 
specific  tumor  clustering.  In  addtion,  36  genes  were  found  to  show  differential  expression 
levels  in  CaP.  MRE11A  gene  was  the  most  down  regulated  gene,  followed  by  the 
members  of  apoptosis  and  cell  cycle  arrest  pathways  (TNFRSF4,  P53AIP1,  BAX,  and 
CDC20,  GSTE1,  respectively).  The  highest  increase  is  seen  predictably  in  RAD51.  In 
total  of  22  genes  showing  increase  in  expression,  12  genes  were  associated  to  genomic 
stability/repair  pathway.  These  data  suggest  that  the  genome  of  CaP  cells  display 
mutational  mechanisms  in  DNA  mismatch  repair  genes  associated  with  genetic 
instability.  Significantly,  RAD51  dysfunction  suggests  a  molecular  mechanism  to 
account  for  the  most  frequent  chromosomal  aberration  in  CaP:  isochromosome  8q.  It  is 
suggested  that  failure  in  the  fidelity  of  homologous  recombination  within  the  repetitive 
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sequences,  that  comprise  the  kinetochore  complex,  could  lead  to  recuurent  loss  of  8p  and 
gain  of  8q  by  rearrangement  of  chromosome  8-specific  alphoid  centromeric  sequences. 
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Introduction 

Prostate  cancer  (CaP)  is  a  major  male  health  challenge  throughout  the  world, 
although  there  is  a  large  variation  in  its  incidence  (Schaid,  2004).  Although  the  causes  of 
this  variation  are  likely  to  be  differences  in  screening  methods,  diet  and  health-related 
behaviors,  clinical  practice  patterns  and  environmental  risk  factors,  the  role  of  genetic 
differences  is  unknown  (De  Marzo  et  al.,  2004;  Schaid,  2004).  Because  of  patient-to- 
patient  heterogeneity  in  the  clinical  behavior  of  this  disease,  prognostic  markers  that  may 
help  tailor  therapeutic  strategies  to  individual  clinical  situations  are  continuously  re¬ 
assessed  (Grasso  et  al.,  2004). 

Regardless  of  the  timing  of  the  morphological  events  differs  from  cell  to  cell, 
recent  studies  of  the  molecular  alteraions  in  CaP  cells  have  begun  to  provide  clues  as  to 
how  prostate  cancer  may  arise  and  progress  (Shaid,  2004).  Oxidative  damage  to  DNA 
and  other  cellular  components  accompanying  chronic  or  recurrent  inflammation  may 
connect  prostate  inflammation  with  cancer,  since  the  genomic  damage  modulates  cancer 
progression  and  the  steps  of  CaP  development  are  androgen  independence,  inhibition  of 
apoptosis,  and  metastasis  (Deutsch  et  al.,  2004).  In  addition,  it  has  long  been  argued  that 
the  tumor  microenvironment  contributes  to  genetic  instability  and  it  is  associated  with 
increased  DNA  damage,  enhanced  mutagenesis  and  functional  impairments  in  DNA 
repairs  pathways  (Reynolds  et  al.,  1996;  Brinda  &  Glazer,  2005).  Despite  the  recent 
advances  in  the  understanding  of  the  environmental,  hormonal,  and  nutritional  origins, 
gene  mutations  reported  in  CaP  appear  quite  heterogeneous,  from  case  to  case,  or  even 
from  lesion  to  lesion  in  a  single  case. 
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The  objetive  of  this  study  was  to  better  understanding  the  early  development  of 
CaP  and  to  identify  significant  differential  genes  by  use  of  pathway-specific  array. 
Understanding  of  the  molecular  mechanisms  behind  the  development  of  CaP  will  enable 
the  identification  of  differential  molecular  biomarker(s),  as  well  as,  fundamental  insights 
concerning  the  regulatory  pathways  of  the  integrity  of  genome  that  may  lead  to  multistep 
field  cancerization. 

Material  and  Methods 

Prostate  tissue  samples  were  collected  through  the  Prostate  Centre  tissue  bank 
(Princess  Margaret  Hospital,  Ontario  Cancer  Institute)  and  the  study  protocol  was 
approved  by  the  University  Health  Network.  Ten  primary  tumors  from  radical 
prostatectomy  and  commercially  available  pooled  RNA  from  1 6  normal  prostatic  tissues 
(Clonetech  Laboratories  Inc.,  Palo  Alto,  CA,  USA)  were  included  in  this  preliminary 
study.  The  tumors  were  pathologically  staged  according  to  the  pathological  tumor  node 
metastasis  (pTNM)  classification  (Ayala  et  al.,  1989)  and  graded  according  to  the 
Gleason  grading  system  (Table  1).  All  tissue  samples  were  snapped  frozen  in  liquid 
nitrogen  and  embedded  in  optimal  cutting  temperature  (OCT)  solution  and  sectioned  into 
six  10pm  thick  slides  for  RNA  extraction,  flanked  by  two  6pm  thick  slides  for 
hematoxilin  and  eosin  (H&E)  staining  and  four  4pm  thick  slides  for 


immunohistochemistry  (IHC). 


6 


Macrodissection  and  RNA  extraction 

Macrodissection  was  performed  only  on  samples  that  had  relatively  high  tumor 
content  (>75%)  but  lack  the  wide  spread  of  tumor  using  adjacent  H&E-stained  sections 
as  guidance.  The  portions  of  interest  of  each  section  were  separated  by  a  scalpel  and 
sterilized  blades,  and  placed  into  a  1.5ml  tube  with  lysis  buffer  containing  a  strong 
protein  denaturant,  guanidine  thiocyanate,  which  aids  in  effective  cell  lysis  and  prevents 
RNase  degradation.  The  extraction  process  was  carried  out  using  the  Absolutely  RNA® 
Microprep  Kit  as  described  by  manufacturer’s  instructions  (Stratagene,  La  Jolla,  CA, 
USA),  and  allowed  quick  purification  of  high  quality  RNA  from  samples  that  contains 
small  number  of  cells  (Dolter  &  Braman,  2001).  RNA  6000  Nano  LabChip®  Kit  (Agilent, 
Mississauga,  ON,  Canada)  and  RiboGreen  (Molecular  Probes,  Eugene,  OR,  USA) 
application  were  employed  to  obtain  accurate  readings  of  quality  and  quantities  of  RNA, 
respectively. 

T7-Ampliflcation 

The  efficient  reproducibility  of  high  yield  of  aRNA  from  200ng  of  mRNA  was 
generated  using  the  Amino  Ally  MessageAmp™  aRNA  kit  (Ambion  Inc.,  Austin,  TX, 
USA),  following  the  procedures  by  Wang  et  al.  (2000).  The  first-strand  cDNA  synthesis 
of  total  RNA  was  primed  with  lpl  of  T7  oligo(dT)  promoter  sequence  by  reverse 
transcription.  Previously  to  cDNA  purification,  the  single-strand  cDNA  was  cleaved  into 
small  fragments  and  converted  to  double-strand  sDNA  by  addition  of  second-strand 
master  mix  containing  lpl  of  RNase  H  and  2 pi  of  DNA  polymerase,  during  2  hours  of 
incubation  at  16°C.  The  amino  allyl  modified  aRNA  was  produced  by  large  scale  in  vitro 
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transcription  reaction  of  purified  cDNA.  In  this  step,  3pil  of  amino  allyl-UTP  was  added 
to  the  aRNA  master  mix  synthesis.  After  a  quick  purification  step,  RNA  6000  Nano 
LabChip®  Kit  (Agilent,  Mississauga,  ON,  Canada)  and  RiboGreen  (Molecular  Probes, 
Eugene,  OR,  USA)'application  were  assessed  to  aRNA  yield  and  quality,  respectively. 

Pathway-specific  array 

The  amplified  aRNA  were  hybridized  in  GEArray®  Q  Series  pathway  array 
containing  112  sites  each  (3  blanks,  3  negative  reference  spots,  10  household  genes,  and 
96  pathway-specific  human  genes)  (SuperArray  Bioscience  Corporation,  Frederick, 
USA).  Human  cDNA  DNA  Damage  Signaling  Pathway  Microarray  (HS-029)  was 
designed  to  assess  the  differential  expression  levels  of  96  prostate  cancer-related  genes 
and  96  key  genes  involved  in  DNA  damage  signaling  pathway.  The  protocol  and  data 
analysis  of  arrays  were  performed  according  to  SuperArray  technical  specifications  and 
the  gene  list  and  array  position  for  expression  profiling  are  available  at  the  web  site 
http://www.superarrav.com. 

In  vitro  convertion  of  2pg  of  amplifed  aRNA  to  biotinylated-dUTP  cDNA  probe 
was  generated  by  applying  MMVL  reverse  transcriptase  (Promega,  Nepean,  ON,  Canada) 
and  biotin- 16-dUTP  (Roche,  Laval,  Quebec,  Canada),  following  the  probe  sysnthesis  for 
chemiluminescent  detection  instructions  from  AmpoLabeling  Linear  Polymerase 
Reaction  (LPR)  Kit  (SuperArray,  Frederick,  USA).  The  biotinylated  cDNA  probes  were 
previously  detectable  at  the  1000-fold  dilution  to  probe  denaturation  and  hybridization  to 
nylon  membranes  spotted  with  gene-specific  cDNA  fragments.  Chemiluminescent 
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analysis  of  membranes  was  subjected  to  quantification  of  binding-signals  of  alkaline 
phosphatase-conjugated  streptavidin  with  CDP-Star  substrate. 

The  array  image  was  recorded  using  Kodak  X-OMAT  LS  film  (Amersham 
Biosciences,  Piscataway,  NJ,  USA)  and  SprintScan  35  Plus  (Polaroid  Canada  Inc., 
Mississauga,  ON,  Canada).  Several  exposure  times  for  capturing  the  membrane  image 
were  archieved  to  produce  the  largest  possible  dynamic  range  in  the  individual  spots.  The 
adjustment  of  exposure  setting  was  based  on  the  minimal  and  maximal  difference 
between  the  blank  spots  with  no  array  signal  and  the  brightest  spots  with  maximal  array 
signal,  since  the  over/underexposure  of  a  number  of  spots  can  affect  the  linearity  of  the 
relationship  between  RNA  amount  and  signal  intensity.  The  image  was  converted  into 
grayscale  and  1 6-bit  tiff  format,  and  the  numerical  data  of  expression  level  of  each  gene 
on  the  GEArrays  was  quantified  using  the  GenePix  software  (Axon,  Union  City,  CA, 
USA)  (Figure  1).  By  allowing  the  GEArrayAnalyzer  software  analysis  (SuperArray, 
Frederick,  MD),  the  raw  data  was  inter-normalized  with  the  normal  prostate  control  and 
intra-normal ized  with  cyclophilin  A  values  present  on  each  of  the  GEArray.  Output 
values  were  also  log2  converted  for  easier  interpretation  and  three  criteria  were  used  to 
determine  differentially  expressed  genes  (Table  2).  (1)  Number  of  values  present  among 
the  4  tumour  samples  for  each  gene  greater  than  75%.  (2)  Relative  standard  deviation  was 
calculated  for  each  gene  across  the  tumour  samples  with  a  1 00%  cut-off.  (3)  Only  genes 
with  average  fold  larger  than  -1,5  and  +1.5  were  considered  for  increase/decrease  in 
expression.  The  hierarchical  clustering  was  performed  using  TIGR  MultiExperiment 
Viewer  (TMEV)  software  version  3.1  (Saeed  et  al.,  2003)  (Figure  2). 
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Results 

Arrayed  nylon  membranes  of  human  DNA  damage  signaling  from  10  radical 
prostatectomy  specimens  (CaP  1  -  CaP  10)  and  one  normal  prostatic  tissue  (11)  were 
used  for  gene  expression  assembly  to  ATR/ATM  signaling  network  and  transcriptional 
targets  od  DNA  damage  response.  The  expression  profiling  of  96  key  genes  was 
categorized  into  three  functional  groups:  cell  cycle  arrest,  apoptosis,  and  genomic 
stability  and  repair  pathways,  as  shown  in  Table  3  (modified  version  of  functional  gene 
grouping  from  SupeArray).  Overall,  two  tumors  clusters  (CaP  5,  6,  7  and  CaP  1,  2,  3,  4, 
8,  9,  10)  were  obtained  using  TMEV  software  version  3.1,  whereas  one  of  these 
groupings  was  divided  in  tthree  more  specific  tumor  clustering  (CaP  1,  3,  4;  CaP  2,  8,  9 
and  CaP  10)  (Figure  1). 

Of  genes  identified  as  expressed  by  the  GEArray Analyzer  (>10%  above 
background  levels),  36  were  found  to  show  differential  expression  levels  in  CaP  as 
defined  by  statistical  normalization  and  ratio  cutoffs  of  1 .5  and  0.5  for  increase/decrease 
in  expression.  The  meiotic  recombination  ( S .  cerevisiae )  1 1  homolog  A  (MRE1 1  A)  gene 
was  the  most  down  regulated  gene  (average  fold  -30.28)  in  CaP  compared  to  normal 
prostate  in  DNA  damage  pathway,  followed  by  the  members  of  apoptosis  and  cell  cycle 
arrest  pathways  (TNFRSF4,  P53AIP1,  BAX,  and  CDC20,  GSTE1,  respectively).  The 
highest  increase  is  seen  predictably  in  RAD51  {S.  cerevisiae)  homolog  (E.  coli  RecA 
homolog)  (RAD  51),  with  a  fold  change  of  18.21  (Table  2).  In  total  of  22  genes  showing 
increase  in  expression,  12  genes  were  associated  to  genomic  stability/repair  pathway. 


Discussion 
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Prostate  cancer  is  a  multifatorial  disease  with  genetic  and  environmental 
components  involved  in  its  etiology  (Nwosu  et  al.,  2001;  Visakorpi,  2003;  Schaid,  2004). 
The  knowledge  of  genetic  basis  of  CaP  has  increased  over  the  past  years,  mainly  in  the 
pathways  that  underlie  tumorigenesis  and  growth  and  drug  resistance  for  each  clinical 
stage  and  specific  histology  (Karan  et  al.,  2002;  Wolter  et  al.,  2002;  Carles  et  al.,  2004). 
Considering  the  general  trends  that  can  be  decipted  from  the  clustering  algorithms,  at 
least  2  tumor-specific  clustering  were  obtained  using  TMEV  software  version  3.1  based 
on  similar  gene  expression  profiles  (Figure  1).  Since  a  correlation  was  not  observed 
between  tumor  hierarchical  clustering,  Gleason  scoring,  pTNM  system  and  serum  PSA 
level,  future  studies  should  include  larger  cohorts  with  long-term  follow-up  data. 

It  has  been  well  established  that  tumor  progression  is  correlated  with  genetic 
instability.  Recent  evidence  suggests  that  the  tumor  microenvironment  itself  constitutes  a 
significant  source  of  such  genetic  instability  by  dysregulation  of  DNA  repair  pathways 
(Bindra  &  Glazer,  2005).  The  genome  of  cancer  cells  displays  mutation  mechanisms  in 
DNA  mismatch  repair  gene  associated  with  microsatellite  instability  (MIN),  in  addition 
to  numerical  and  complex  structural  changes  to  whole  chromosomes  recognized  by 
chromosomal  instability  (CIN)  (Caburet  et  al.,  2002;  DeMarzo  et  al.,  2003;  Storchova 
and  Pellman,  2004).  Although  the  mechanisms  inducing  CIN  are  complex  and  numerous, 
the  most  important  are:  defective  mitotic  checkpoints  and  abnormal  DNA  replication 
events  leading  to  missegregation  or  aneuploidy  (Duensing  and  Munger,  2001;  Caburet  et 
al.,  2002;  Storchova  and  Pellman,  2004)  telomere  dysfunction  (Artandi  et  al.,  2000; 
O’Hagan  et  al.,  2002;  Charames  and  Bapat,  2003;  Vukovic  et  al.,  2003),  and  breakage- 
fusion-bridge  cycles  (Caburet  et  al.,  2002).  Following  to  complexity  of  genetic 
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instability,  it  seems  that  high  fidelity  process  of  homologous  recombination  is  the  major 
DNA  repair  pathway,  which  is  indispensable  for  the  maintenance  of  genetic  stability 
(Thompson  and  Schild,  2001).  Hyperactivity  of  the  homologous  recombination  pathway 
could  also  contribute  to  inappropriate  recombination  between  expansion  repeat  resulting 
in  translocations,  deletions,  or  duplications  (Zhou  et  al.,  2001).  Human  RAD51  (s. 
cerevisiae)  homolog  (E.  coli  RecA  homolog)  (RAD51)  gene  is  a  relatively  small  protein, 
approximately  38  kDa  that  is  likely  to  be  a  target  for  regulator  factors  such  as  p53 
(Baumann  &  West,  1998;  West,  2003).  The  result  of  a  higly  positive  expression  of 
RAD5 1  in  this  study  implies  that  this  gene  can  be  involved  in  cancer  development.  The 
recent  finding  that  linked  increase  homologous  recombination  to  tumor  susceptibility 
supports  this  hypothesis.  Overexpression  of  the  homologous  recombination  and  DNA 
repair  protein  RAD51  has  been  reported  in  immortalized  and  tumor  cells,  providing  a 
correlation  between  elevated  RAD51  protein  levels,  genome  instability,  and  tumor 
progression  (Flygare  et  al.,  2001).  RAD51  dysfunction  suggests  a  molecular  mechanism 
to  account  for  the  most  frequent  chromosomal  aberration  in  CaP:  isochromosome  8q  (Fan 
et  al.,  2004;  Macoska  et  al.,  2004).  It  is  suggested  that  failure  in  the  fidelity  of 
homologous  recombination  within  the  repetitive  sequences,  that  comprise  the  kinetochore 
complex,  could  lead  to  recuurent  loss  of  8p  and  gain  of  8q  by  rearrangement  of 
chromosome  8-specific  alphoid  centromeric  sequences. 

Additionally,  the  result  of  a  negative  expression  of  GADD45a  proposes  the 
involvement  of  cancer  development,  increasing  the  transformation  of  carcinoma.  The 
growth  arrest  and  DNA  damage-inducible  protein  45alpha  (GADD45a)  gene  is 
predominantly  a  nuclear  protein  and  it  has  a  important  role  in  cellular  response  to  DNA 


12 


damage  because  it  is  considered  to  partake  in  controlling  the  cell-cycle  checkpoint, 
apoptosis,  and  DNA  repair  after  DNA  damage  (Carrier  et  al.,  1994,  Daino  et  al.,  2003).  It 
was  shown  that  cells  derived  from  GADD45a-null  mice  exhibited  genomic  instability, 
proto-oncogene  activation,  increase  cellular  proliferation  and  reduced  DNA  repair 
(Hildesheim  et  al.,  2002). 

The  majority  of  neoplasias  develop  as  a  result  of  a  complex  accumulation  of 
genetic  alterations.  The  precise  contribution  played  by  each  mutation  remains  unclear, 
although  many  associations  have  been  made  between  gene  mutation  and  disease  stage.  A 
key  challenge  has  therefore  been  to  link  individual  genetic  changes  with  the  cellular 
mechanisms  underlying  disease.  DNA  double  strand  breaks  represent  the  most 
threatening  lesion  to  the  integrity  of  the  genome  in  cells  exposed  to  ionizing  radiation  and 
radiomimetic  chemicals.  Those  breaks  are  recognized,  signaled  to  cell  cycle  checkpoints 
and  repaired  by  protein  complexes.  Several  distinctive  expressed  genes  involved  in 
ATR/ATM  signaling  network  were  identified:  MRE11A,  CHEkl,  RADI,  ATM  and 
HUS1.  Ataxia-telangiectasia-mutated  (ATM)  is  a  member  of  the  phosphoinositide  3- 
kinase  (PI 3-kinase)  family  that  also  includes  the  catalytic  subunit  of  DNA-dependent 
protein  kinase  (DNA-PKcs)  and  A-T  and  Rad3-related  protein  (ATR)  all  of  which  are 
involved  in  the  recognition  of  damage  in  DNA  (Abraham,  2001).  ATM  is  one  of  a 
growing  number  of  proteins  which  when  mutated  compromise  the  stability  of  the  genome 
and  predispose  to  tumour  development.  The  MRE11A  gene  is  part  of  a  complex, 
Mrel  1/Nbsl/Rad50  (MNR)  complex,  which  acts  as  a  sensor  to  DNA  damage  (Stewart  et 
a.,  1999).  The  MNR  complex  is  associated  with  the  DNA  DSBs  and  remains  at  these  sites 
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until  the  damage  is  repaired  (Lavin  et  al.,  2005).  It  is  suggested  that  pertubation  of  the 
MNR  complex  through  mutated  MRE1 1  predisposes  cancers  (Giannini  et  al.,  2004). 

Conclusions 

Postate  cancer  represents  a  fundamental  research  challenge:  multiple  foci  and 
genetic  heterogeneity  between  different  primary  foci  in  the  same  prostate.  Without 
isolating  and  identifying  common  genetic  features,  it  is  difficult  to  propose  realistic 
mechanistic  models,  and  without  such  models  it  is  impossible  to  propose  changes  to 
present  treatment  regimens.  The  pathway-specific  array  data  suggest  that  the  genome  of 
CaP  cells  display  mutation  mechanisms  in  DNA  mismatch  repair  gene  associated  with 
genetic  instability  and  the  high  fidelity  process  of  homologous  recombination  can  be  the 
major  DNA  repair  pathway,  which  is  indispensable  for  the  maintenance  of  genetic 
stability.  Thus,  a  detailed  genomewide  approach  is  required  to  answer  questions 
concerning  genetic  variations  in  cancer. 
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Table  1.  Clinicopathological  information  from  ten  radical  prostatectomy  specimens 


Sample 

mn 

PSA* 

IjgMMB 

Gleason 

Score 

Gleason 

Grade 

PIN 

Tumor 

content** 

CaP  1 

ACT 

66 

5.00 

pT3aNxMx 

7/10 

3+4 

+ 

>75% 

CaP  2 

ACT 

66 

11.00 

pT2cNxMx 

7/10 

4+3 

+ 

>75% 

CaP  3 

ACT 

55 

19.56 

pT3bNlMx 

9/10 

4+5 

N/A 

>75% 

CaP  4 

ACT 

67 

9.80 

pT2cNxMx 

9/10 

5+4 

- 

>75% 

CaP  5 

ACT 

60 

7.00 

pT2cNxMx 

7/10 

3+4 

+ 

>90% 

CaP  6 

ACT 

63 

14.06 

pT3aNxMx 

7/10 

4+3 

+ 

>90% 

CaP  7 

ACT 

64 

53.00 

pT3bN0Mx 

9/10 

5+4 

- 

>75% 

CaP  8 

ACT 

N/A 

N/A 

N/A 

N/A 

N/A 

N/A 

>75% 

CaP  9 

ACT 

48 

5.70 

pT2cNxMx 

7/10 

3+4 

- 

>75% 

CaP  10 

ACT 

59 

4.90 

pT2cNxMx 

7/10 

3+4 

N/A 

>75% 

PSA:  Prostate-specific  antigen;  pTNM:  pathological  tumor,  nodes,  metastases  system; 
PIN:  Prostatic  intraepithelial  neoplasia;  ACT:  Adenocarcinom,  conventional  type;  N/A: 
not  available;  +:  present;  -:  ausent. 

*  Age  and  PSA  value  at  time  of  prostatectomy;  **  Tumor  percentage  before 


macrodissection. 
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Table  2.  Data  exploration  of  genes  differentially  expressed  between  CaP  and  normal 
prostatic  tissue  obtained  using  MS  Excel®  Microsoft  (version  2003) 


MRE11A 


TNFRSF4 


P53AIP1 


CDC20 


GTSE1 


BAX 


RRM2B 


BAK1 


LRDD 


TNF 


CHEK1 


RPA3 


GADD45A 


FEN1 


CDC25B 


RADI 


ERCC5 


BTG2 


PURA 


ERCC3 


BBC3 


RPL13A 


ATM 


ERCC1 


H2AFL 


E2-EPF 


CKN1 


UNG2 


PIN  1 


BIRC3 


DDB1 


PTPRCAP 


HUS1 


DDB2 


MSH2 


RAD51 


Linear  Standard 
Deviation 

Relative  Standard 
Deviation 

0.10 

316.2% 

0.11 

316.2% 

0.13 

316.2% 

0.16 

212.7% 

0.58 

246.4% 

0.57 

220.9% 

0.43 

158.2% 

0.40 

142.8% 

0.58 

194.6% 

0.69 

212.1% 

0.57 

122.2% 

0.79 

131.0% 

0.74 

113.7% 

0.82 

124.3% 

2.31 

139.2% 

1.80 

104.4% 

2.11 

118.9% 

1.72 

94.4% 

2.38 

121.9% 

2.86 

125.4% 

3.20 

135.7% 

0.53 

20.8% 

2.47 

92.3% 

5.18 

186.7% 

3.32 

118.7% 

4.68 

160.0% 

4.31 

131.2% 

6.01 

178.8% 

6.99 

191.3% 

5.50 

143.2% 

4.46 

95.9% 

10.76 

207.2% 

9.38 

169.9% 

17.46 

301.8% 

29.28 

226.8% 

28.65 

157.4% 

-4.92 

-30.28 

-4.80 

-27.86 

-4.55 

-23.43 

-3.72 

-13.18 

-2.10 

-4.29 

-1.96 

-3.89 

-1.88 

-3.68 

-1.85 

-3.61 

-1.75 

-3.37 

-1.62 

-3.08 

-1.09 

-2.13 

-0.74 

-1.67 

-0.62 

-1.54 

-0.60 

-1.52 

0.73 

1.66 

0.79 

1.73 

0.83 

1.78 

0.86 

1.82 

0.96 

1.95 

1.19 

2.28 

1.24 

2.36 

1.34 

2.53 

1.42 

2.68 

1.47 

2.78 

1.48 

2.8 

1.55 

2.92 

1.72 

3.29 

1.75 

3.36 

1.87 

3.65 

1.94 

3.84 

2.22 

4.65 

2.38 

5.19 

2.47 

5.52 

2.53 

5.79 

3.69 

12.91 

4.19 

18.21 
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Table  3.  Functional  Gene  Grouping:  modified  version  from  SupeArray  Database 


ATM 


PPIA 


Normal  CaP  1 


Figure  1.  GEArray-Q  Series  Human  DNA  Damage  Signaling  Pathway  Gene  Array.  DNA 
damage  response  profiles  for  CaP  1  and  normal  prostatic  sample. 
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KNSL5 

FEN1 

GADD45B 

GADD45A 

DDIT3 

F0X03A 

RPA3 

CDK4 

CHEK2 

53BP1 

CDS1 

XRCC1 

GAPD 

RADS  2 

ERCC5 

BBC3 

H2AFL 

CDC25C 

XPA 

ACTB 

XRCC5 

REPRIMO 

APEX 

CDC25B 

CDC20 

MRE11A 

RRM2B 

BAX 

P53AIP1 

CHEK1 

TNFRSF4 

BAK1 

GTSE1 

LRDD 

TKF 

PPXA 

BIRC2 

PPM1D 

MSH6 

BTG2 

Blank 

RPL13A 

UNG2 

ERCC1 

DDB2 

DDB1 

ATRX 

RADI 

PURA 

ERCC3 

E2-EPF 

PIN1 

CKK1 

PTPRCAP 

ATM 

B1RC3 

HUS1 

MSH2 

RAD51 


Figure  2.  Illustration  of  the  hierarchical  clustering:  96  key  genes  and  10  household  genes 
were  identified  through  statistical  significance  and  fold  change  criteria  in  10  CaP 
samples.  Levels  of  expression  are  represented  on  a  scale  from  the  lowest  expression 
(bright  green)  to  the  highest  expression  (bright  red).  The  color  black  represents  a  zero 


expression. 
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APPENDIX  3 


«• 


Figure  1.  Immunohistochemical  nuclear  and  cytoplasmatic  patterns  obtained  with  the 
commercially  available  PTEN  mouse  monoclonal  antibody  (1:100,  clone  26H9)  in 
Prostate  cancer  TMAs.  A-  The  negative  control  treated  identically  but  without  incubation 
of  the  primary  PTEN  antibody.  B  and  C-  Strong  and  diffuse  immunostaining  positivity 
for  PTEN  in  the  same  benign  glandular  epithelium  and  PIN,  respectively.  D-  Focal  low 
expression  in  areas  of  cancer  in  comparison  with  positivity  in  normal  stromal  cells.  E-  A 
low  magnification  of  TMA  and  F-  H&E  section  show  foci  of  PIN  and  cancer. 
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cijiiiuioiim:  m  niran 

■  * 

PTEN  Control 
Probe 


B 


Figure  2.  Illustration  of  nterphase  FISH  on  prostate  TMAs  using  dual  color  probes  for 
PTEN  locus  (red  signal)  and  centromere  of  chromosome  10  (green  signal). 

A-  Probe  configuration. 

B-  Overall,  PTEN  locus  remained  normal  (2  red  signals  and  2  green  signals)  in  some 
cancer  cells. 

C-  Overall,  PTEN  locus  remained  normal  (2  red  signals  and  2  green  signals)  in  PIN. 

D-  PTEN  locus  deletion  (1  red  signal  and  2  green  signals)  in  cancer  cells. 
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Use  of  Whole  Genome  Amplification  and  Comparative  Genomic  Hybridisation  to 
detect  chromosomal  copy  number  alterations  in  cell  line  material  and  tumour 

tissue. 
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Abstract 


We  have  established  that  whole  genome  amplification  (WGA)  in  conjunction  with 
genomic  DNA  array  comparative  genomic  hybridisation  (gaCGH)  allows  for  the 
identification  of  genome-wide  copy  number  abnormalities  (CNAs)  in  DNA 
extracted  from  both  cell  line  and  patient  material.  To  determine  the  fidelity  and 
reproducibility  of  WGA  to  detect  copy  number  imbalances  using  gaCGH,  well 
characterized  cell  line  genomic  DNA  was  analysed.  The  gaCGH  data  obtained 
from  non-amplified  DNA  and  amplified  DNA  for  the  neuroblastoma  cell  line  NUB7 
and  a  paediatric  medulloblastoma  patient  was  almost  identical.  In  addition,  laser 
capture  microdissection  (LCM)  of  prostate  tumour  cells  and  subsequent  WGA 
allowed  for  the  detection  of  a  number  of  CNAs  that  may  not  have  been  identified 
if  DNA  had  been  extracted  in  bulk  from  heterogeneous  tissue.  The  results 
presented  here  demonstrate  the  use  of  WGA  for  generating  sufficient  DNA  for 
gaCGH  analysis  without  the  introduction  of  significant  sequence  representation 
bias.  The  combination  of  amplification  and  gaCGH  using  DNA  extracted  from 
archival  patient  material  has  the  potential  for  permitting  the  studying  of  DNA  from 
small  cancerous  or  pre-cancerous  foci,  which  may  help  to  identify  potential 
genomic  markers  for  early  diagnosis. 
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Introduction 


Comparative  genomic  hybridisation  (CGH)  has  provided  a  significant 
contribution  to  our  understanding  of  chromosomal  changes  associated  with 
tumour  development  and  progression,  making  it  possible  to  detect  regions  of 
chromosomal  gain  and  loss  in  a  single  experiment  (James  et  at.,  1999). 
Conventional  CGH  uses  metaphase  chromosomes  and  provides  a  typical 
resolution  of  10  Mb  for  a  simple  loss  or  gain  (Kallioniemi  et  al.,  1992)  and 
approximately  2  Mb  for  a  high  copy  number  gain.  Though  this  offers  a  suitable 
starting  point  for  analysis  of  chromosomal  changes,  the  limited  resolution  means 
that  small  focal  changes  may  be  missed.  Recent  advances  that  have  substituted 
the  metaphase  chromosomes  with  well-defined  arrayed  sequences  of  DNA,  have 
allowed  for  a  increase  in  resolution  from  whole  chromosomes,  to  considerably 
shorter  cDNA,  bacterial  artificial  chromosome  (BAC)  or  PI  artificial  chromosome 
(PAC)  clones  (Cai  et  al.,  2002;  Kraus  et  al.  2003).  This  innovation  has  allowed 
for  a  more  accurate  measure  of  chromosomal  changes  thus  greatly  refining  the 
results  produced  by  chromosome  CGH  (Hui  et  al.,  2001;  Wessendorf  et  al. 
2003).  Despite  this  progress,  a  major  drawback  with  the  current  CGH 
methodology  is  its  dependence  on  the  availability  of  large  quantities  of  genomic 
DNA  extracted  from  bulk  tissue.  This  requirement  is  prohibitive  as  for  many 
cancers  those  key  genetic  changes  that  are  of  major  interest  are  likely  to  occur  at 
an  early  phase  in  the  tumourigenetic  process.  These  changes  are  likely  to  be 
restricted  to  a  small  portion  of  pre-malignant  or  malignant  cells  within  the 
specimen  under  examination.  As  a  result,  such  alterations  may  be  masked,  as 
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the  data  obtained  from  CGH  for  bulk  extracted  DNA  represents  an  average  value 
for  all  cells  within  the  specimen.  Even  though  CGH  can  detect  chromosomal 
changes  that  are  present  in  as  little  as  30-50%  of  starting  tumour  cells  (du  Manoir 
et  al. ,  1995),  it  is  likely  that  these  changes  will  correspond  to  original  high  copy 
number  changes,  whereas  low  copy  number  changes  may  be  obscured  by  the 
50-70%  of  contaminating  normal  cells. 

Recent  work  with  prostate  cancer,  which  is  characterized  by  the  presence 
of  pre-malignant  and  malignant  lesions  interspersed  within  normal  non- 
cancerous  tissue,  has  demonstrated  the  utility  of  a  microscope-guided  hollow 
bore  to  obtain  a  relatively  pure  cell  population  that  provides  sufficient  DNA  for 
use  in  array  CGH  (Paris  et  al.,  2003).  However,  in  many  instances,  as  the 
amount  of  patient  tissue  is  a  limiting  factor  and  as  lesions  of  interest  may  only 
consist  of  a  few  hundred  cells,  it  is  possible  that  boring  will  also  collect  a 
significant  number  of  contaminating  normal  cells. 

Previous  studies  have  established  the  efficacy  of  WGA  approaches  to 
obtain  sufficient  amounts  of  DNA  for  array  CGH  (Lage  et  al.,  2003).  However,  to 
date,  analysis  has  been  limited  to  use  of  whole  genome  cDNA  arrays  and 
genomic  DNA  arrays  for  individual  chromosomes.  In  this  report  we  show  that 
WGA  approaches,  such  as  degenerate  oligo-nucleotide  primed  (DOP)-PCR  and 
strand  displacement  amplification  (SDA)  in  conjunction  with  whole  genome 
gaCGH,  allow  for  identification  of  CNAs  from  cell  line  and  patient  DNA  that  show 
a  close  correlation  to  the  results  obtained  for  gaCGH  from  non-amplified  DNA.  In 
combination,  these  results  demonstrate  that  WGA,  specifically  SDA,  allows  for 
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faithful  amplification  of  genomic  DNA.  The  fidelity  of  SDA  when  combined  with 
laser  capture  microdissection  (LCM)  and  gaCGH  can  allow  for  the  detection  of 
CNAs  from  a  small  number  of  starting  cells. 

Materials  and  Methods 

Laser  capture  microdissection 

Fresh-frozen  prostate  tissue  embedded  in  Frozen  Section  Medium  (Stephens 
Scientific,  Riverdale  NJ,  USA)  was  cut  onto  microscope  slides  (5pm  thickness) 
and  stained  using  the  HistoGene  LCM  frozen  section  staining  kit  (Arcturus, 
Carlsbad,  USA).  Regions  of  cancer,  constituting  in  excess  of  1000  cells,  were 
identified  and  obtained  by  laser  capture  microdissection  (LCM)  using  the  Cell 
Robotics  LaserScissors  system  (Cell  Robotics,  Inc.,  Albuquerque,  NM,  USA). 

DNA  Extraction 

Reference  DNA  (from  human  placental  tissue;  46,  XY)  and  test  DNA  (from 
the  NUB7  cell  line,  a  paediatric  medulloblastoma  (90%  tumour)  and  the  captured 
prostate  cancer  cells)  was  prepared  by  proteinase  K  digestion  and  phenol- 
chloroform  extraction  (Molecular  Cloning:  A  laboratory  manual,  Third  Edition). 
DNA  concentration  was  determined  by  measuring  absorbance  at  A260,  with  the 
exception  of  the  DNA  obtained  from  captured  cells  where  the  concentration  was 
determined  using  the  PicoGreen  dsDNA  Quantitation  kit  (Molecular  Probes,  Inc., 
Eugene,  OR,  USA). 
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DOP-PCR  amplification 

DOP-PCR  was  carried  out  in  two  individual  steps.  The  Step  1  reaction 
mixture  contained  5-1  Ong  template  DNA  (from  diluted  reference  or  test  DNA),  Ipl 
of  10X  Thermosequenase  Buffer  (Amersham,  Baie  D’Urfe,  Canada),  IpM  of 
DOP-PCR  primer  (5’-CCG ACTCG AG N N N N N NAT GT GG-3’) ,  0.2mM  dNTPs,  4 
units  of  Thermosequenase  (Amersham),  and  sterile  water  up  to  a  final  volume  of 
10pl.  The  PCR  profile  for  Step  1  was:  denaturation  at  94°C  for  60  seconds, 
annealing  at  30°C  for  60  seconds,  ramping  to  72°C  over  a  3  minute  period 
(3.5°C/15  seconds)  and  extension  at  72°C  for  2  minutes.  The  reaction  was 
carried  out  for  5  cycles  with  a  final  extension  at  72°C  for  10  minutes.  The  Step  2 
reaction  mixture  contained  4pl  of  10X  PCR  reaction  buffer  (Invitrogen,  Burlington, 
Canada),  1.5mM  magnesium  chloride,  0.3pM  of  DOP-PCR  primer,  0.4pM 
dNTPs,  1.25  units  of  Thermus  aquaticus  (Taq)  polymerase  (Invitrogen)  and 
sterile  water  up  to  a  final  volume  of  40pl.  The  step  1  and  2  reaction  mixtures 
were  then  combined.  The  PCR  profile  for  Step  2  was:  denaturation  at  94°C  for 
60  seconds,  annealing  at  55°C  for  60  seconds  and  extension  at  72°C  for  2 
minutes.  The  reaction  was  carried  out  for  30  cycles  with  a  final  extension  at 
72°C  for  10  minutes. 

Strand  displacement  amplification 

Strand  displacement  amplification  was  carried  out  in  two  individual  steps. 
The  step  1  reaction  mixture  contained  5-1  Ong  of  DNA  (from  diluted  reference  or 
test  DNA),  IX  ThermoPol  buffer  (New  England  Biolabs,  Pickering,  Canada),  a 


6 


primer  (modified  random  heptamer  5’  [NITP][NITP]NNNnN  3’)  at  30|iM  (in  final 
reaction  volume  of  20|d)  and  sterile  water  up  to  a  volume  of  6pl.  This  solution 
was  first  heated  at  95  °C  for  2  minutes,  then  left  at  room  temperature  for  10 
minutes  and  finally  chilled  on  ice.  The  step  2  reaction  (amplification)  mixture 
contained  IX  ThermoPol  buffer  (New  England  Biolabs),  dNTPs  at  0.4mM 
(Invitrogen),  0.4%  DMSO  (Sigma-Aldrich,  St.  Louis,  USA),  0.35  units  of  Bst  DNA 
polymerase  (New  England  Biolabs),  5.2ng  of  T4  gene  32  protein  (Amersham) 
and  sterile  water  up  to  a  final  volume  of  14pl.  The  step  1  and  2  reaction  mixtures 
were  combined  and  then  incubated  at  50°C  for  6  hours.  The  enzymes  were 
inactivated  by  heating  at  65  °C  for  10  minutes,  followed  by  cooling  to  4  °C. 

Assessment  of  amplification 

Five  microlitres  of  each  amplification  reaction  was  electrophoresed 
through  a  1%  agarose  gel  and  stained  with  ethidium  bromide  in  order  to  assess 
product  yield  and  product  length.  All  amplification  products  were  purified  by 
phenol-chloroform  extraction  and  DNA  concentration  and  purity  were  determined 
by  measuring  absorbance  at  A260  and  A280. 

Spectral  Genomic  BAC  arrays 

The  genomic  DNA  arrays  used  in  these  experiments  were  obtained  from 
Spectral  Genomics  Inc.  (Houston,  Texas,  USA)  and  consist  of  1403  unique  BAC 
and  PAC  clones,  which  provide  an  average  genomic  resolution  of  2-4  Mb. 
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DNA  labeling 

Array  experiments  were  performed  using  both  amplified  and  non-amplified 
DNA.  Reference  DNA  and  test  DNA  were  first  digested  overnight  at  37°C  using 
10  units  of  Rsal  (Invitrogen)  in  a  10p.l  reaction.  The  digested  DNA  was  labeled 
using  the  protocol  optimized  by  Spectral  Genomics,  with  separate  labeling 
reactions  for  Cy3  and  Cy5  being  set  up  for  both  reference  and  test  DNA.  Briefly, 
labeling  reaction  were  set  up  containing  1-2pg  of  DNA  (genomic  or  amplified), 
20pl  of  2.5X  random  primer/reaction  buffer  mix  (Invitrogen)  and  sterile  water  up 
to  a  final  volume  of  45pl.  The  reaction  mix  was  boiled  for  5  minutes  prior  to 
cooling  on  ice  and  the  addition  of  2.5pl  of  Spectral  labeling  buffer  (Spectral 
Genomics,  Houston,  U.S.A),  1.5^1  of  either  Cy3-dCTP  (ImM,  Applied 
Biosystems,  Foster  City,  U.S.A)  or  Cy5-dCTP  (ImM,  Applied  Biosystems)  and 
Ipl  of  Klenow  Fragment  (BioPrime  labelling  kit,  Invitrogen).  The  reaction  was 
mixed  gently  and  then  incubated  for  2  hours  at  37°C.*  Following  incubation  the 
reaction  was  stopped  by  the  addition  of  5pil  0.5M  EDTA  (pH8)  and  heating  at 
72°C  for  10  minutes. 

Hybridisation 

The  Cy3  labeled  test  DNA  was  combined  with  the  Cy5  labeled  normal 
reference  DNA  and  vice  versa.  Each  combined  probe  was  mixed  with  45pl  of 
Spectral  Hybridisation  Buffer  (Spectral  Genomics),  11.3pl  of  5M  NaCI  and  IIOpI 
of  room  temperature  isopropanol.  The  samples  were  incubated  in  the  dark  at 
room  temperature  for  10-15  minutes  and  centrifuged  at  16,000  x  g  for  10  minutes 
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and  the  supernatant  discarded.  The  pellets  were  then  washed  with  500pl  of  70% 
ethanol.  The  supernatant  was  carefully  removed  and  the  pellets  air-dried  at  room 
temperature  in  the  dark.  For  hybridisation,  the  pellets  were  first  resuspended  in 
1 0jul  of  sterile  water  prior  to  being  mixed  with  30pl  of  Spectral  Hybridisation 
Buffer  II  (Spectral  Genomics)  by  pipetting.  The  reconstituted  probes  were  then 
incubated  at  72°C  for  10  minutes,  placed  on  ice  for  5  minutes  and  then  incubate 
for  30  minutes  at  37°C.  The  probes  were  hybridised  to  BAC  arrays,  covered  with 
a  22x60mm  coverslip  and  incubated  for  12-16  hours  at  37°C  in  a  humified 
chamber. 

Washes 

The  wash  buffers,  with  the  exception  of  Wash  I,  were  pre-warmed  to  40°C. 
The  slides  were  gently  dipped  into  and  out  of  Wash  I  (2X  SSC,  0.5%  SDS)  until 
the  coverslip  detached  from  the  slide.  The  slides  were  then  washed  once  in 
Wash  II  (2X  SSC,  50%  deionized  Formamide,  pH  7.5)  for  20  minutes,  followed 
by  successive  washes  in  Wash  III  (2X  SSC,  0.1%  NP-40,  pH  7.5)  for  20  minutes 
and  Wash  IV  (0.2X  SSC,  pH  7.5)  for  10  minutes.  All  washes  were  performed  at 
40°C,  with  the  exception  of  Wash  I.  The  slides  were  briefly  submerged  in 
distilled  deionized  water  for  5-10  seconds  and  centrifuged  for  3-5  minutes  at  750 
rpm  to  dry. 
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Data  collection  and  Analysis 

The  slides  were  scanned  using  an  Axon  GenePix  4000A  confocal 
scanner,  each  fluorescence  signal  was  collected  separately  and  quantified  with 
the  GenePix  Pro  3.0  software  (Axon  Instruments,  Union  City,  U.S.A).  The  data 
was  normalised  and  analysed  using  Normalise  Suite  vl.63  (Beheshti  et  al., 
2003).  Software  and  complete  data  sets  for  all  samples  can  be  viewed  at 
http://www.utoronto.ca/cancyto  under  the  study  data  section. 

Results 

Whole  genome  amplification  of  DN A  from  tissue  culture  cells  and  tumour  tissue 

Using  DOP-PCR  and  SDA  we  were  able  to  consistently  amplify  in  excess 
of  1  pg  of  DNA  from  a  starting  concentration  of  5-1  Ong.  Though  amplification  was 
possible  from  lower  concentrations  of  starting  material  (less  than  Ing)  the 
product  quality,  as  assessed  by  the  array  results,  was  not  consistent.  The  length 
of  fragments  obtained  using  the  two  WGA  approaches  differed  considerably. 
DOP-PCR  generated  products  ranging  from  0.2kb  to  3kb,  whereas  SDA 
consistently  produced  products  in  excess  of  lOkb.  Variation  in  amount  of  starting 
material  (in  the  range  of  5-1  Ong),  for  either  protocol,  had  little  effect  on  the 
corresponding  product  size  or  smear  intensity.  No  detectable  background 
amplification  was  observed  in  the  negative  controls  for  either  protocol. 
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Genomic  DNA  array  CGH  for  the  NUB7  cell  line 

For  the  NUB7  cell  line  studied  in  this  work  the  presence  or  absence  of 
CNAs,  as  observed  by  gaCGH  using  either  non-amplified  or  amplified  DNA,  was 
consistent  with  previous  findings.  The  gaCGH  results  depicted  in  Figure  1  for 
NUB7  (Dimitroulakos  et.  al.,  1994)  were  obtained  using  non-amplified,  SDA 
amplified  and  DOP-PCR  DNA.  The  gray  shading  denotes  chromosomal  regions 
that  display  CNAs  for  all  three  types  of  DNA  preparation.  The  arrows  indicate 
CNAs  introduced  by  DOP-PCR.  gaCGH  for  NUB7  detected  and  further  refined 
all  of  the  CNAs  that  had  previously  been  identified  using  chromosome  CGH  and 
cDNA  microarrays  (Squire  et  al.,  2003),  these  being  gains  of  1p32.1  -  qter, 
7q32.2  -  qter  and  1 7q1 2  -  qter  and  loss  from  4q11  -  q22.1  and  6q16.1  -  qter 
(Figure  1).  In  addition,  a  loss  of  11  ql  3.3  -  qter  was  also  detected. 

The  gaCGH  results  obtained  for  NUB7  using  DNA  amplified  by  DOP-PCR 
identified  the  entire  set  of  CNAs  listed  above,  however  additional  CNAs  were 
observed  that  were  not  corroborated  by  the  results  obtained  using  the  non- 
amplified  DNA  (Figure  1).  As  a  consequence,  from  this  point  onwards  we 
decided  to  focus  on  the  use  of  SDA,  as  the  results  obtained  for  the  NUB7  cell  line 
showed  100%  of  the  previously  identified  CNAs,  and  importantly  no  additional 
alterations. 

The  control  CGH  experiments  comparing  differentially  labeled  non- 
amplified  and/or  amplified  placental  DNA  did  not  display  any  chromosomal 
imbalances.  In  addition,  those  ideograms  that  do  not  display  any  CNAs  in  non- 
amplified  DNA  are  very  similar  to  those  from  the  amplified  DNA  (Figure  1). 
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These  results  show  that  WGA  introduces  little  or  no  amplification  bias  indicating 
that  the  chromosomal  changes  observed  for  these  samples  are  not  artifacts  of 
the  experimental  procedures. 

Genomic  DNA  array  CGH  from  tissue 

When  gaCGH  was  performed  using  amplified  and  non-amplified  DNA 
extracted  from  a  paediatric  medulloblastoma,  alterations  constituting  loss  of  17p 
and  gain  of  17q  were  observed,  in  addition  to  a  focal  amplification  at  5p12  -  pi 3 
for  both  DNA  types  (Figure  2).  The  gaCGH  results  obtained  for  the  two 
microdissected  prostate  DNA  samples  (CaP11-01  and  CaP32-01)  identified 
several  CNAs  (Table  1,  Figure  3),  which  are  in  keeping  with  previous  result 
observed  for  prostate  cancer  (Cunningham  et  al.,  1996;  Qian  et  al.,  1999). 

Discussion 

Analysis  of  genomic  alterations  in  tumour  tissue  is  often  restricted  by  the 
limited  amount  of  sample  material  from  which  only  small  amounts  of  genomic 
DNA  can  be  obtained.  Whole  genome  amplification  using  DOP-PCR  (Hirose  et 
al.  2001;  Grant  et  al.  2002)  or  SDA  (Lage  et  al.  2003;  Hosono  et  al.  2003)  has 
permitted  the  use  of  advanced  molecular  analyses  that  would  previously  not 
have  been  possible  due  to  the  limited  quantity  of  DNA  available. 

The  merit  of  WGA  is  its  ability  to  faithfully  amplify  the  entire  genome  with 
little  representational  bias.  DOP-PCR  utilises  a  degenerate  oligonucleotide 
primer  in  combination  with  an  initial  low  annealing  temperature  to  achieve  non- 
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specific  amplification  from  multiple  sites  (approximately  106  sites  in  humans) 
within  the  genome  (Telinius  et  al.,  1992).  The  appeal  of  this  approach  is  that  no 
a  priori  knowledge  of  the  DNA  under  investigation  is  required.  Though  DOP- 
PCR  has  been  used  for  many  applications  its  shortcomings  have  been  well 
reported.  It  is  acknowledged  that  DOP-PCR  does  not  provide  either  complete 
coverage  of  all  loci  (Paunio  et  al.,  1996)  or  copy  the  target  DNA  in  its  entirety 
(Chueng  and  Nelson,  1996).  In  addition,  PCR-based  WGA  methods  have  been 
reported  to  cause  4  to  6-fold  amplification  bias  (Dean  et  al.,  2002).  In  this 
context,  the  use  of  DOP-PCR  is  not  an  ideal  approach  for  WGA,  as  the 
amplification  bias  could  produce  false  results. 

Recently,  a  rolling  circle  amplification  method,  originally  developed  for  the 
amplification  of  large  circular  DNA  templates  (Dean  et  al.,  2001),  has  been 
adapted  for  the  amplification  of  genomic  DNA,  and  allows  for  the  synthesis  of 
DNA  strands  in  excess  of  10  kb  in  length.  Similarly  to  DOP-PCR  no  previous 
knowledge  of  the  target  template  is  required.  Using  Bst  polymerase,  T4  gene  32 
protein  and  exonuclease  resistant  random  heptamers,  DNA  is  amplified  in  a  50°C 
reaction.  The  polymerase  is  capable  of  synthesizing  microgram  quantities  of 
DNA  from  as  little  as  5ng  of  starting  material,  making  is  ideally  suited  for  genomic 
analysis  where  starting  DNA  is  limited.  Additionally,  in  contrast  to  DOP-PCR 
SDA  provides  good  genome  coverage  and  does  not  introduce  significant 
amplification  bias  (Lage  et  al.,  2003). 

The  use  of  DOP-PCR  in  the  analysis  of  NUB7  introduced  a  number  of 
additional  CNAs  that  were  not  corroborated  by  the  results  obtained  using  the 
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non-amplified  DNA  (Figure  1).  If  this  bias  was  repeated  in  the  amplification  of 
patient  samples,  where  no  additional  information  was  available,  it  might  lead  to 
incorrect  assignment  or  failure  to  detect  CNAs.  In  contrast  to  DOP-PCR,  SDA 
introduced  no  detectable  bias  in  the  samples  studied,  when  compared  to  non- 
amplified  DNA,  confirming  the  fidelity  of  this  approach. 

To  date,  a  majority  of  the  CGH  array  data  has  been  restricted  to  cDNA 
arrays  (Pollack  et  al.,  1999;  Beheshti  et  al.,  2003),  however  there  are  now  an 
increasing  number  of  articles  describing  CGH  using  other  array  platforms  (Cai  et 
al.,  2002;  Lucito  et  al.,  2003).  Despite  clear  advantages  over  chromosome  CGH, 
cDNA  arrays  exhibit  a  lack  of  consistency  in  hybridisation  and  poor  specific 
signal-to-noise  ratios.  This  attribute  can  cause  an  overall  reduction  in  sensitivity 
and,  as  a  result,  a  decreased  potential  for  detecting  single  copy  changes 
(Hughes  et  al.,  in  press).  In  contrast,  genomic  DNA-based  arrays  provide  a  more 
accurate  approach  for  detecting  single  copy  number  gains  and  losses.  This 
characteristic  has  the  potential  to  allow  for  the  discovery  of  CNAs  that  would 
otherwise  not  have  been  detected  using  either  cDNA  array  CGH  or  metaphase 
CGH.  This  is  observed  here  with  the  detection  of  additional  CNAs  in  the  NUB7 
cell  line.  Despite  the  advantages  of  genomic  DNA-based  arrays,  they  currently 
have  uneven  chromosome  coverage,  resulting  in  many  genomic  regions  being 
under  represented.  This  lack  of  uniformity  was  demonstrated  by  the  failure  to 
detect  the  amplification  of  MYCN  at  2p24,  which  has  previously  been  reported  for 
NUB7  (Squire  et  al.,  2003),  as  a  clone  covering  this  region  is  not  present  on  the 
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array.  The  ongoing  development  of  higher  resolution  arrays  will  likely  address 
this  problem. 

The  data  presented  here  for  the  NUB7  cell  line  and  the  paediatric 
medulloblastoma  demonstrates  that  SDA,  in  conjunction  with  gaCGH,  produces 
results  almost  identical  to  non-amplified  DNA  and  thus  allows  for  the  faithful 
identification  of  chromosomal  alterations.  In  addition,  the  detection  of  CNAs  in 
amplified  DNA  obtained  from  microdissected  prostate  cancer  foci  indicates  that 
SDA  and  gaCGH  may  provide  an  approach  for  studying  chromosomal  changes 
in  cancerous  and  pre-cancerous  lesions  present  within  the  same  tissue  section. 

The  strategy  outlined  above  should  provide  new  insights  into  the  biology 
of  many  tumours  that  contain  multiple  foci,  potentially  allowing  for  the 
identification  of  genomic  regions  of  imbalance  associated  with  various  stages  of 
tumour  progression  and  metastasis.  The  combination  of  these  approaches  have 
the  promise  to  aid  in  the  identification  of  specific  genetic  markers,  which  may 
prove  to  be  important  for  making  prognostic  and/or  therapeutic  decisions. 
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Figure  and  Table  legends 


Figure  1.  The  use  of  (i)  non-amplified,  (ii)  SDA  amplified  and  (iii)  DOP-PCR 
amplified  NUB7  DNA  in  conjunction  with  gaCGH,  allows  for  the  detection  of  a 
gain  of  genetic  material  on  chromosomes  1,  7  and  17  and  a  loss  from 
chromosomes  4,  6  and  11.  The  gray  shading  denotes  chromosomal  regions  that 
display  CNAs  for  all  three  types  of  DNA  preparation.  The  arrows  indicate  CNAs 
introduced  by  DOP-PCR. 

Figure  2.  Chromosomal  alterations  observed  for  a  paediatric  medullooblastoma, 
as  determined  by  gaCGH  from  (i)  non-amplified  and  (ii)  SDA-amplified  DNA.  A 
focal  gain  was  observed  on  5p,  in  addition  to  loss  of  17p  and  gain  of  17q. 
Chromosomes  1  and  10,  which  do  not  possess  any  CNAs,  are  displayed  to 
demonstrate  that  amplification  using  SDA  does  not  introduce  any  aberrant  CNAs. 
The  gray  shading  denotes  chromosomal  regions  that  display  CNAs  for  both  DNA 
preparations. 

Figure  3.  The  chromosomal  alterations  observed  for  the  two  prostate  cancer 
samples,  a)  CaP11-01  and  b)  CaP32-01.  The  DNA  used  for  CGH  was  obtained 
by  laser  capture  microdissection  of  tumour  cells  and  subsequent  SDA. 

Table  1.  Chromosomal  changes  in  microdissected  prostate  cancer  DNA. 
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Table  1 


Prostate  sample 

Gain 

Loss 

CaP11-01 

8q22.t  -  qter 

2p1 6.3  -22.1 

8pter-  pi  2 

CaP32-01 

1 4p1 1 .2  —  ql  1 .2, 
15pter-  q23 

3q26.2  -  qter,  8p 
13q12.2  -  qter, 

16q22  -  qter 
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Abstract 

The  availability  of  large  amounts  of  genomic  DNA  is  of  critical  importance  for  many  of  the  molecular 
biology  assays  used  in  the  analysis  of  human  disease.  However,  since  the  amount  of  patient  tissue  available 
is  often  limited  and  as  particular  foci  of  interest  may  consist  of  only  a  few  hundred  cells,  the  yield  of  DNA  is 
often  insufficient  for  extensive  analysis.  To  address  this  problem,  several  whole  genome  amplification 
(WGA)  methodologies  have  been  developed.  Initial  WGA  approaches  were  based  on  the  polymerase  chain 
reaction  (PCR).  However,  recent  reports  have  described  the  use  of  non-PCR  based  linear  amplification 
protocols  for  WGA.  Using  these  methods,  it  is  possible  to  generate  microgram  quantities  of  DNA  starting 
with  as  little  as  one  nanogram  of  genomic  DNA.  This  review  will  provide  an  overview  of  WGA  approaches 
and  summarise  some  of  the  uses  for  amplified  DNA  in  various  high-throughput  genetic  applications. 

©  2004  Elsevier  Ltd.  All  rights  reserved. 

Keywords:  Whole  genome  amplification  (WGA);  PEP;  DOP-PCR;  SCOMP;  SDA;  TLAD 


1.  Introduction 

Recent  advances  in  the  study  and  understanding  of  human  disease  have  largely  been  made 
possible  by  advances  in  molecular  biology  techniques.  The  sequencing  of  the  human  genome  has 
provided  the  first  human  genetic  blue  print  for  all  23  chromosome  pairs.  Though  the  tools  and  the 
technologies  for  the  analysis  of  this  information  are  still  in  their  infancy,  the  information  they  will 
eventually  provide  will  allow  scientists  and  clinicians  to  identify  and  treat  those  individuals  at  the 
risk  of  cardiovascular  disease,  neuro-psychiatric  disorders,  cancer  and  a  myriad  of  other  genetic 
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disorders,  prior  to  development  of  disease.  The  ability  to  better  diagnose,  monitor  and  manage 
these  disease  processes  is  the  basis  of  much  of  the  research  performed  around  the  world. 

The  starting  point  for  a  large  proportion  of  human  disease  research  is  DNA.  However, 
acquisition  of  this  most  basic  of  requirements  is  often  the  stumbling  block  for  many  researchers. 
Human  disease  research  often  suffers  from  an  inherent  lack  of  sufficient  representative  genomic 
DNA  from  patient  material.  This  can  result  from  the  mode  of  sample  collection  (Zheng  et  al., 
2001),  or  the  requirement  for  pure  tumour  cell  populations  obtained  by  macrodissection  or 
(Chung  et  al.,  2000)  laser  capture  microdissection  (LCM)  (Wang  et  al.,  2001;  Hughes  et  al.,  in 
press).  The  net  result  of  such  factors  is  a  restriction  on  the  type  and  number  of  high-throughput 
genetic  assays  that  can  be  performed.  As  a  result,  researchers  are  often  required  to  implement  a 
candidate  gene  or  region  specific  methodology,  in  contrast  to  a  more  holistic  whole  genome 
approach,  due  to  the  limited  quantities  of  patient  DNA.  Despite  the  success  of  candidate  type 
approaches  (Thakker  et  al.,  1993;  Bates  et  al.,  1997),  it  is  likely  that  many  key  genetic  alterations 
that  could  prove  to  be  important  for  making  prognostic  and/or  therapeutic  decisions  may  go 
undetected  (Simpson  et  al.,  2003).  It  is  therefore  clear  that  some  form  of  whole  genome 
amplification  (WGA)  is  required  in  order  to  provide  an  ample  supply  of  DNA  for  large-scale 
genetic  studies.  For  molecular  analyses  there  are  currently  three  techniques  that  can  be  used: 
PCR-based  approaches,  strand  displacement  amplification  and  T7  based  linear  amplification.  The 
ideal  situation  for  any  such  approach  is  the  generation  of  results  from  the  amplified  DNA  that  are 
indistinguishable  from  the  results  obtained  from  the  original  genomic  DNA.  (Information  for 
each  technique  described  in  this  review  is  listed  in  Table  la  and  b). 

Many  of  the  papers  reporting  the  use  of  WGA  have  applied  it  to  the  analysis  of  human  cancers 
and  though  WGA  has  utility  in  other  fields  such  as  pre-implantation  genetic  diagnosis  (Jiao  et  al., 
2003;  Ao  et  al.,  1998;  Sermon  et  al.,  1996;  Kristjansson  et  al.,  1994)  and  prenatal  diagnosis 
(Martel-Petit  et  al.,  2001;  Sekizawa  et  al.,  1996),  this  review  will  concentrate  on  the  application  of 
WGA  to  the  study  of  human  malignancies.  Below  we  will  summarise  the  evolution  of  WGA 
methods  (both  PCR  and  non-PCR  based)  as  well  as  discuss  the  limitations  of  each  method  as  it 
pertains  to  human  genetic  research. 


2.  PCR-based  approaches 

The  first  PCR-based  genome  amplification  approach,  interspersed  repetitive  sequence  (IRS) 
PCR  (Nelson  et  al.,  1989),  used  primers  designed  to  anneal  to  repetitive  sequences  (Alu  repeats) 
within  the  genome.  The  disadvantages  of  this  approach  were  that  it  relied  on;  (1)  the  repeats  being 
evenly  distributed  throughout  the  genome  and  (2)  the  repeats  being  suitably  positioned  to  allow 
for  the  generation  of  a  PCR  product.  Even  though  Alu  repeats  are  present  throughout  the  human 
genome,  they  have  a  tendency  to  be  clustered  at  intrinsically  unstable  regions  of  the  genome 
(Kolomietz  et  al.,  2002)  and  in  the  G-light  chromosome  bands  (Korenberg  and  Rykowski,  1988). 
Due  to  the  location  of  the  repeats  there  is  a  bias  towards  amplification  of  these  regions.  An 
alternative  approach  to  overcome  these  limitations  was  introduced  by  Ludecke  et  al.  (1989).  Their 
method,  termed  the  linker  adapter  technique  (LA)  PCR,  involved  restriction  digestion  of  the 
DNA  and  ligation  of  adaptors  to  serve  as  priming  sites  for  subsequent  PCR.  These  genome 
amplification  approaches  were  initially  applied  to  the  cloning  of  microdissected  chromosomes 


S.  Hughes  et  al.  I  Progress  in  Biophysics  &  Molecular  Biology  I  (Hit)  III— III 


0  13  ^ 

U  u  1/3 


W  b  <»  .S3  *3  U  (N 

b  J  O  C  H  O 
<  O—-  (N  cS  C-  <N  CT*  CL,  <o 


>  5  Q  =3 
’•*3  4)  2  4> 

2  Q  °  i  .5 


£  §  S'  1  ?5  ©r 


2  c 

-  I  « 

2  ,5  o 
a  EC  o 

<3  w  <N 


©  >N< 

21a 

C/2 

<d  ■  .2 

U(  ~ 

•g  ofl  c 

£.*s  o 


u<  .3  3  s 

g  3  ©IS  g 
-S  S 


g  I  § 


c  b  -a  z 

n  w  c  ^ 


^1  >• 

O  c/3 

<D  S  cj 

H  T3  W 


T3 

<U 

c  .b  ^ 

O  ^  X1 

--  cd  «  g  +5 

rt  o  ^  3 

h  <G  ^  c  a 

y  *C  cL.  J3  cc2 

8  p  Jj  8  S 

O)  a  to  52 


oo 

l\  ^  rt 


<N  <o  f- 
>n  o  ^j- 


OD 

s  S  -s 

.2  <j  g-  2 

1  |fj  | 

IS  20  § 


o 

O 

o 

o 

o 

o 

o  o 

o  o 

(N 

io 

^  cK 

On 

oo  r- 

O  On 

CN 

00 

’ — 1  1 

-h  rf 

g  fi  Si 

o  ^  « 


8  | 
S  20 
09  2^ 

60  I  H 
.g  ■g'-a 
*06 
3  &H  CJ 


o 

o  o 

o 

OC 

© 

lo 

o  o 

o 

■'St 

m 

CM  CM 

CM 

t»  .2 

o  ti 
o  cs 
u 

£  crt 
C  *Hh 

.5  g 

g  3 

b  .y  « 

|  |  ■§ 
<  a  ■£ 

/-^v  <u  o 

eo  s 


x.  s  w 

F  <u 
^  ^ 

Q  pSj  2 

£-h  5 

*  ft  8* 

O  U  CO 

^  ^  c  S 

a,  a,  |  © 

o  o  a  ©  • 

Q  ci  P  2  i 


.c  ^ 

.SP  x 

CM 

y5  T3  W 
^  £  1.1 
5  £  H  I 


it 


S.  Hughes  et  al.  I  Progress  in  Biophysics  &  Molecular  Biology  I  IUH)  Ill-Ill 


5 


<>5 

o* 

© 

•O 

> 

O' 

© 

r- 

> 

On 

O' 

© 

04 

*o 

, 

04 

04 

O' 

°o 

04 

O' 

Ov 

04 

04 

vo 

t\ 

l/N 

CO 

On 

o 

04 

V-) 

O- 

rn 

K 

cd 

— 

> 

© 

cn 

Vo 

*d 

© 

rf 

© 

-H 

© 

© 

© 

— J 

«y-i 

© 

© 

© 

© 

© 

© 

©  © 

© 

© 

©  © 

© 

© 

© 

© 

© 

© 

© 

© 

© 

©  ro 

© 

©  © 

© 

© 

©  © 

© 

© 

© 

© 

O; 

04 

© 

lO 

^  © 

ON 

vd 

ON 

© 

04  VO 

uo 

© 

u-i 

uo 

O^ 

lO 

On 

04 

O' 

rf  04 

© 

VO  oo 

OO 

CO  . 

»o  00 

VO 

VO 

O' 

04 

rn 

— • 

VO  04 

© 

04 

CO  04 

04 

04 

04 

04 

O  Tf 

Tj- 

04  ^ 


u 

CU 


o 

o 


c 

9 

W) 

o 


o 

o 


o  o 
o  o 

<N  — * 


g 

«u 

bo 

o 


p- 

g  ^ 

&  o 

>5  VO 

v  CO 


<D 


bo 

G  X 

o 


o 

o  ,  , 

UO  Ol 

<N  ^  W  ^ 

I-  s g  - 

=■  «  Cl. 

*  >>  ? 

W  c/5  H 


S>  £ 


F-h  £_' 

Q  ^ 


<U 

O  .§ 
£  £ 


■§.§> 

c3 

b  < 
2  £ 
&Q 

"d  Tf 

<  H 


a 


<D 


o  o  o  o 

©  ©  vo  uo 

—  in  (N 


bO 

a 

S'  ® 

O  w 
©  e 
©  .£ 
oo  w 


*U 

<G 


§  „ 
^  o 


© 
VO 
-G  ^ 
.$?  X 
^  04 


<d  2 

3  ft 

S3  fN 


_  "O 

i — ,  c 
i-J  cd 
O  CL 
©  * 
S  w 


o 

q  § 

&3  to 


<I> 

r-.'  G 
O  <D 


ft  o  | 

is  H  h  £ 


©  © 
wo  © 
04  © 
04 


OQ 

W 

£ 


«2  xs 


§ 

© 

© 

m 

04 


© 

© 

04 


<D 

43 

O 

o 

Oh 


©  ©  ©  © 
o  ©  © 

•O  »-H  04 


G 
.  O 
2 


o 


Cl 


•2  D  .a 


.  a  o 
G  4*4  © 


© 

© 

© 


CQ 

W 

55 

>  > — / 

£ 

o 

G 


iaj  fj 

h  h 


u 
.«  u 
aS 


> 

t§ 


a  £ 


O  Q, 
GO  .© 

a}  c 
W>  o 
O  C/5 


%  G 

t^-  3  U 

h  73  h 


'All  prices  are  in  Canadian  dollars. 


6 


S.  Hughes  et  a!.  /  Progress  in  Biophysics  &  Molecular  Biology  I  (WU)  Ill-Ill 


(Saunders  et  al.,  1989)  and  for  the  detection  of  regions  of  DNA  involved  in  DNA/protein 
interaction  (Kinzler  and  Vogelstein,  1989),  not  to  the  process  of  WGA. 

In  1992,  the  first  two  WGA  methods  of  degenerate  oligonucleotide-primed  (DOP)  PCR 
(Telenius  et  al.,  1992)  and  primer  extension  preamplification  (PEP)  (Zhang  et  al.,  1992)  were 
published.  These  techniques  have  allowed  for  the  amplification  of  an  entire  genome-worth  of 
DNA  starting  from  as  little  as  a  single  cell,  which  has  also  been  shown  to  be  possible  for  RNA 
(Eberwine  et  al.,  1992).  Modifications  of  DOP-PCR  and  PEP  (long  products  from  low  DNA 
quantities  (LL)  DOP-PCR,  Kittler  et  al.,  2002;  improved  (I)  PEP,  Dietmaier  et  al.  1999)  have 
allowed  for  increases  in  both  PCR  efficiency  and  DNA  yield.  In  addition,  the  LA-PCR  approach 
(Ludecke  et  al.,  1989)  has  since  been  adapted  for  WGA  (Klein  et  al.,  1999;  Tanabe  et  al.,  2003). 
These  advances  have  increased  the  capacity  for  a  more  comprehensive  genetic  analysis  of  human 
diseases. 

2.1.  Degenerate  oligonucleotide  primed  PCR  (DOP-PCR) 

Degenerate  oligonucleotide  primed  (DOP)  PCR  is  a  well-established,  technically  straightfor¬ 
ward  method  that  is  widely  accepted  as  a  mode  of  WGA.  DOP-PCR-generated  DNA  has  been 
used  for  many  applications  including  metaphase  CGH  (Ottesen  et  al.,  2003;  Beheshti  et  al.,  2002; 
Harada  et  al.,  2002;  Hirose  et  al.,  2001;  Larsen  et  al.,  2001;  Huang  et  al.,  2000;  Verhagen  et  al., 
2000),  array  CGH  (Peng  et  al.,  2003;  Daigo  et  al.,  2001)  single  nucleotide  polymorphism  (SNP) 
genotyping  (Grant  et  al.,  2002;  Barbaux  et  al.,  2001),  microsatellite  genotyping  (Kittler  et  al.  2002; 
Kim  et  al.,  1999;  Cheung  and  Nelson,  1996)  mutation  detection  by  single  stranded  conformational 
polymorphism  (SSCP)  and/or  heteroduplex  analysis  (Barbaux  et  al.,  2001).  DOP-PCR  was  first 
described  by  Telenius  et  al.  (1992)  as  a  method  for  genome  mapping  studies  that  did  not  suffer 
from  the  limitations  of  IRS-PCR  (Ledbetter  et  al.,  1990)  and  was  less  technically  challenging  than 
LA-PCR  (Ludecke  et  al.,  1989;  Saunders  et  al.,  1989).  Since  its  conception,  DOP-PCR  has  been 
adapted  by  several  groups  in  order  to  lower  the  required  amount  of  starting  template  (Hirose 
et  al.,  2001),  to  increase  yield  (Huang  et  al.,  2000),  fidelity  and  fragment  length  (Kittler  et  al., 
2002),  and  to  provide  better  genome  coverage  (Larsen  et  al.,  2001;  Huang  et  al.,  2000; 
Kuukasjarvi  et  al.,  1997).  DOP-PCR  uses  a  partially  degenerate  oligonucletoide  primer  that  binds 
during  several  low- temperature  annealing  cycles  at  many  sites  in  the  genome  (approximately  106 
sites  in  humans).  This  pre-amplification  step  is  followed  by  PCR  amplification  at  a  higher 
temperature  to  allow  only  those  initial  fragments  that  are  tagged  with  the  specific  part  of  the 
primer  sequence  to  be  amplified. 

There  is  considerable  debate  within  reported  studies  regarding  the  possible  limitations  of 
DOP-PCR  and  its  use  in  clinical  applications.  A  comparison  of  the  results  obtained  from  genetic 
analysis  of  DOP-PCR  amplified  DNA  with  those  from  non-amplified  DNA  from  the  same 
specimens  demonstrates  that  amplification  provides  good  genome  representation  (Kim  et  al., 
1999).  In  addition,  Peng  et  al.  state  that  “the  sensitivity  and  specificity  of  CGH  were  scarcely 
influenced  by  preceding  DOP-PCR  amplification”,  but  do  acknowledge  that  there  were 
differences  in  the  quality  of  CGH  microarray  results.  These  have  been  attributed  to  different 
labelling  methods  as  DOP-PCR  labelling  tended  to  give  false  positive  results,  while  nick 
translation  labelling  of  DOP-PCR  products  had  limitations  in  detecting  losses  (Peng  et  al.,  2003). 
Though  its  utility  in  SNP  analysis  has  been  proven  (Jordan  et  al.,  2002),  Grant  et  al.  note  that 
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data  clusters  in  SNP  genotyping  are  not  as  tight,  resulting  in  an  increase  in  ambiguous  calls  for 
DOP-PCR  amplified  templates  (error  rate  0.7%).  However,  they  believe  that  these  errors 
(including  unequal  amplification  and  generation  of  extra  alleles)  could  be  attributed  to  the 
original  quality  of  the  DNA  sample  (Grant  et  al.,  2002). 

The  amount  and  quality  of  the  starting  material  used  for  DOP-PCR  is  an  issue  that  has  been 
addressed  by  several  groups  using  this  technique.  In  general  it  is  agreed  that  starting  with  less  than 
1  ng  of  genomic  DNA  severely  decreases  the  fidelity  of  the  resulting  experiment  and  increases  the 
likelihood  of  allele  dropout  (Kittler  et  al.,  2002)  although  some  groups  have  successfully  used  less 
starting  material  (Huang  et  al.,  2000).  DNA  extracted  from  fixed  tissue  has  also  been  successfully 
DOP-amplified  and  used  for  subsequent  analysis.  Unfortunately  issues  regarding  preferential 
amplification  of  alleles  in  DNA  from  fixed  tissues  have  not  yet  been  adequately  addressed. 


2.2.  Primer  extension  preamplification  (PEP) 

PEP  was  first  described  by  Zhang  et  al.  (1992)  who  utilised  PEP  to  amplify  DNA  from  single 
sperm  nuclei  prior  to  analysis  of  specific  genes.  The  original  PEP  protocol  involves  a  50  cycle  PCR 
program  using  Taq  polymerase  and  a  15  base  random  oligonucleotide  primer.  However,  in 
contrast  to  a  standard  PCR,  following  denaturation  the  primers  are  allowed  to  anneal  at  a  low 
stringency  temperature  (37°C),  which  is  then  gradually  ramped  (10s/°C)  up  to  55°C  prior  to  a 
4  min  elongation  step  (55°C).  The  choice  to  use  random  primers  was  based  on  the  assumption  that 
one  primer  extension  product  is  just  as  likely  to  undergo  primer  extension  during  successive  cycles 
as  any  other  product.  Through  this  work  it  was  determined  that  via  PEP  approximately  78%  of 
the  genome  from  a  single  haploid  cell  can  be  copied  at  least  30  times  (Zhang  et  al.,  1992).  It  has 
been  proposed  that  not  all  DNA  sequences  are  capable  of  being  amplified  by  PCR,  possibly  due 
to  their  repetitive  nature  or  their  sequence  composition.  Therefore  the  22%  of  the  genome  that  is 
not  amplified  by  PEP  could  correspond  to  such  regions.  Over  time,  the  PEP  protocol  has  been 
optimised  by  utilisation  of  proof  reading  enzymes  and  altered  PCR  cycling  conditions  (improved 
(I)  PEP,  Dietmaier  et  al.,  1999).  These  adaptations  have  allowed  for  greater  WGA  efficiency 
(Dietmaier  et  al.,  1999;  Heinmoller  et  al.,  2002)  from  single  cells  and  small  cell  clusters. 

Since  its  conception,  PEP  has  been  used  for  many  of  the  same  applications  as  DOP-PCR  and 
has  allowed  for  multiple  genetic  analyses.  However,  compared  to  DOP-PCR,  the  results  obtained 
from  PEP  have  been  reported  to  be  more  reliable  in  certain  circumstances.  The  use  of  restriction 
fragment  length  polymorphism  (RFLP)  PCR  to  compare  DOP-PCR,  PEP  and  I-PEP, 
demonstrated  that  I-PEP  has  an  increased  efficiency  for  amplification  of  DNA  from  both 
individual  cells  (40%  efficiency  vs.  15%  for  standard  PEP  and  3%  for  DOP-PCR)  as  well  as  up  to 
five  cells  (100%  efficiency  vs.  33%  for  standard  PEP  and  20%  for  DOP-PCR).  In  addition,  WGA 
by  I-PEP  has  allowed  for  100%  sequencing  accuracy  of  greater  than  4000  sequenced  base  pairs 
from  Ki-ras  (Dietmaier  et  al.,  1999).  A  number  of  reports  have  highlighted  the  success  of  PEP 
using  source  DNA  from  both  ethanol  fixed  paraffin  embedded  (EFPE)  tissue  (Heinmoller  et  al., 
2002)  and  formalin  fixed  paraffin  embedded  (FFPE)  tissue  (Bataille  et  al.,  2003;  Heinmoller  et  al., 
2003;  Wang  et  al.,  2001;  Heinmoller  et  al.,  2000;  Dietmaier  et  al.,  1999;  Duddy  et  al.,  1998).  This  is 
of  particular  interest  to  those  researchers  who  are  limited  to  the  use  of  archival  tissues  fixed  in  this 
manner.  Though  microsatellite  analysis  can  be  performed  on  the  DNA  amplified  from  a  single  cell 
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from  fixed  tissue  (Heinmoller  et  al.,  2002),  more  reliable  results  are  obtained  from  multiple  cells 
(Dietmaier  et  al.,  1999),  or  fresh  tissue  (Zheng  et  al.,  2001). 

One  of  the  previously  mentioned  problems  in  the  study  of  human  cancers  is  the  limitation  on 
the  number  of  tests  that  can  be  carried  out  on  restricted  quantities  of  DNA  or  individual  cells. 
Advances  in  PEP-PCR  have  allowed  for  the  most  comprehensive  allelotype  analysis  in  pituitary 
tumours  so  far  performed  (Simpson  et  al.,  2003).  Analysis  of  50  FFPE  sporadic  non-functioning 
pituitary  tumours  (25  non-invasive,  25  invasive)  and  50  FFPE  somatotrophinomas  (25  non- 
invasive,  25  invasive)  determined  that  LOH  is  more  frequent  in  invasive  tumours  than  in  their 
non-invasive  counterparts  (Simpson  et  al.,  2003).  Not  only  is  PEP  suitable  for  the  analysis  of 
malignant  cells,  it  can  also  be  used  for  the  study  of  microdissected  premalignant  cells,  such  as 
pancreatic  intraductal  lesions,  which  can  possess  many  of  the  important  genetic  changes 
associated  with  pancreatic  tumourigenesis  (Heinmoller  et  al.,  2003). 

Although  microsatellite  analysis  of  PEP  amplified  DNA  has  permitted  a  refinement  of  LOH 
mapping  in  several  tumours  (ovarian  cancer,  Wang  et  al.,  2001;  cervical  intraepithelial  neoplasia, 
Chung  et  al.,  2000;  bladder  cancer  Simoneau  et  al.,  1999;  head  and  neck  squamous  lesions, 
Coombes  et  al.,  1998)  and  allowed  for  the  identification  of  chromosomal  regions  harbouring 
tumour  suppressor  genes  (TSG),  there  are  still  specific  limitations  (see,  Section  3).  Like  DOP- 
PCR,  however,  PEP  is  a  relatively  straightforward  protocol  that  is  easily  implemented  in  most 
laboratory  settings. 

2.3.  Ligation-mediated  PCR  (LMP) 

Genetic  analysis  of  small  DNA  samples  using  genomic  representations  (Lucito  et  al.,  1998)  is  a 
method  of  WGA  that  has  not  been  widely  reported  in  the  literature,  likely  due  to  the  technical 
difficulties  involved.  However,  this  method,  which  involves  ligating  an  adapter  sequence  on  to  a 
“representation”  of  DNA  to  facilitate  subsequent  PCR  amplification,  is  showing  new  promise  in 
the  analysis  of  DNA  samples. 

Single  Cell  Comparative  Genomic  Hybridization  (SCOMP)  is  a  PCR-based  strategy  for  global 
amplification  of  genomic  DNA  from  a  single  cell  (Klein  et  al.,  1999).  In  principle  it  is  very  similar 
to  the  LA-PCR  method  described  by  Ludecke  et  al.  (1989)  and  Saunders  et  al.  (1989).  For 
SCOMP,  genomic  DNA  (instead  of  individual  chromosome  bands)  is  digested  with  the  restriction 
endonuclease,  Msel,  which  transfers  it  to  a  high  complexity  representation  (Lucito  et  al.,  1998) 
with  a  fragment  size  distribution  between  100  and  1500  base  pairs.  Two  oligonucleotides  are  used 
in  a  pre-annealing  step  to  form  an  adaptor  complex  that  is  then  ligated  to  the  overhangs  on  the 
genomic  DNA  fragments.  By  using  a  single  primer  for  subsequent  PCR  amplification,  the 
sequence  complexities  of  multiple  primer  binding  sites  that  are  required  by  DOP-PCR  and  PEP 
techniques  are  avoided.  The  elegance  of  this  protocol  is  that  all  steps  including  DNA  isolation, 
restriction  enzyme  digestion,  primer  ligation  and  PCR  amplification  are  performed  in  the  same 
tube,  which  avoids  template  loss  and  excessive  sample  handling.  Validation  was  achieved  through 
successful  metaphase  CGH,  LOH  and  sequence  analysis  to  detect  known  DNA  copy  number 
aberrations  and  TP53  mutations  (Klein  et  al.,  1999).  In  addition,  the  breast  cancer  cell  line  MCF7 
was  analysed  using  interphase  FISH  and  the  results  confirmed  the  heterogeneity  detected  in  this 
cell  line  by  CGH  analysis.  Comparison  of  DOP-PCR  and  SCOMP  has  determined  that  SCOMP 
is  superior  to  DOP-PCR  in  eliminating  PCR-based  introduction  of  bias  when  using  FFPE 
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samples  combined  with  laser  capture  microdissection  (Stoecklein  et  al.,  2002).  The  use  of  SCOMP 
to  generate  sufficient  DNA  for  CGH  analysis  of  single  disseminated  cancer  cells  from  the  bone 
marrow  of  breast  cancer  patients  has  put  forth  compelling  evidence  that  challenges  the  widely  held 
view  that  the  precursors  of  metastasis  are  derived  from  the  most  advanced  clone  within  the 
primary  tumour  (Schmidt-Kittler  et  al.,  2003).  These  studies  highlight  the  importance  of  studying 
the  genomic  profiles  of  multiple  single  cells  within  a  tumour  to  determine  the  genetic  changes 
associated  with  progression  and  metastasis. 

A  further  adaptation  of  this  type  of  WGA  has  been  to  ligate  primer  sequences  on  to  randomly 
sheared  genomic  DNA  in  a  procedure  termed  adaptor-ligation-mediated  PCR  of  randomly 
sheared  genomic  DNA  (PRSG)  (Tanabe  et  al.,  2003).  Tanabe  and  co-workers  have  used  exon 
amplification  and  genotyping  of  307  microsatellites,  in  addition  to  array  CGH,  to  validate  this 
method  of  WGA.  The  random  shearing  of  the  DNA  may  successfully  mimic  DNA  that  has  been 
fixed  prior  to  WGA. 

As  LMP  has  not  yet  been  widely  used  for  whole  genome  analysis,  many  of  the  problems 
associated  with  the  technique,  if  they  exist,  will  not  yet  have  been  reported.  Though  initial  results 
from  our  lab  and  others  are  promising,  a  more  thorough  investigation  of  different  tissue  types  and 
fixation  conditions  is  required  before  it  will  be  possible  to  determine  relative  advantages  of  LMP 
over  other  PCR  based  approaches. 

Our  group  and  others  have  been  dissatisfied  with  the  results  produced  using  DOP-PCR  and 
PEP  (Sanchez-Cespedes  et  al.,  1998;  Hughes  et  al.,  in  press).  Imbalanced  amplification  of 
microsatellites  (Cheung  and  Nelson,  1996)  and  SNPs  (Grant  et  al.,  2002;  Paunio  et  al.,  1996)  as 
well  as  incomplete  genome  coverage  (Dean  et  al.,  2002)  and  failure  to  produce  reliable  CGH  data 
(Wells  et  al.,  1999)  have  all  been  observed.  However,  in  contrast  other  groups  have  obtained 
satisfactory  results  (Kuukasjarvi  et  al.,  1997;  Umayahara  et  al.,  2002).  This  discrepancy  is  likely  a 
result  of  tissue  handling  and  processing  as  well  as  the  down  stream  application  in  which  the 
amplified  DNA  is  being  used. 


3.  Non-PCR-based  approaches 

Though  PCR-based  WGA  has  been  used  for  many  applications,  its  shortcomings  have  been 
well  reported.  Protocols  such  as  DOP-PCR  and  PEP  amplify  DNA  in  an  exponential  fashion  and 
exponential  amplification  protocols  are  highly  susceptible  to  bias  and  have  a  tendency  to  alter  the 
information  content  of  the  DNA.  In  the  worst  case,  this  could  cause  regions  of  the  genome  to  be 
over  amplified  or  even  lost.  By  the  use  of  quantitative  real  time  PCR  Dean  et  al.  (2002)  have 
identified  an  amplification  bias  of  103-106  between  genomic  loci  in  PEP  amplified  products.  In 
addition,  PCR-based  methods  typically  generate  small  products,  usually  200  bp  to  2  kb,  which  can 
limit  some  down  stream  applications.  Furthermore,  it  has  been  acknowledged  that  DOP-PCR 
does  not  provide  complete  coverage  of  all  loci  (Paunio  et  al.,  1996),  or  copy  the  target  DNA  in  its 
entirety  (Cheung  and  Nelson,  1996)  and  can  preferentially  amplify  shorter  alleles,  Alu  sequences 
and  microsatellites  (Wells  et  al.,  1999).  All  of  these  factors  can  introduce  experimental  error  with 
respect  to  LOH  studies  and  SNP  analysis.  For  these  reasons,  many  researchers  have  switched  to 
the  use  of  non-PCR  based  linear  amplification  protocols.  These  protocols  have  a  distinct 
advantage  over  PCR  methods  as  they  preserve  sequence  representation,  do  not  introduce 
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sequence  content  bias  and  with  reference  to  strand  displacement  amplification,  provide  good 
genome  coverage  (Dean  et  al.,  2002). 

3.1.  Strand  displacement  amplification  (SDA) 

The  process  of  SDA,  or  multiple  displacement  amplification  (MDA,  Lizardi,  2000)  is  based  on 
the  rolling  circle  amplification  (RCA)  type  mechanism  by  which  circular  DNA  molecules  such  as 
plasmids  or  viruses  frequently  replicate  (Kornberg  and  Baker,  1992).  This  rolling  circle 
amplification  method  was  initially  adapted  for  the  amplification  of  large  circular  DNA  templates 
(Dean  et  al.,  2001)  and  more  recently  for  the  amplification  of  genomic  DNA  (Dean  et  al.,  2002). 
Similar  to  other  WGA  approaches,  no  previous  knowledge  of  the  target  template  is  required, 
however,  in  contrast  to  PCR-based  methods  the  lengths  of  the  DNA  products  following  SDA  are 
usually  in  excess  of  10  kb. 

SDA  uses  either  phi29  DNA  polymerase  or  a  combination  of  Bst  polymerase  and  T4  gene  32 
protein  in  conjunction  with  modified  random  primers  to  amplify  the  entire  genome  (Dean  et  al., 
2002;  Lage  et  al.,  2003).  In  contrast  to  PCR-based  methods  no  repeated  cycling  is  required. 
Instead  a  short  initial  (2-3  min)  94°  C  denaturation  is  followed  by  an  enzyme  specific  incubation 
(phi29  30°C,  Bst  50° C)  step  of  6-18  h  and  a  final  65°C  step  of  15  min  to  inactivate  the  enzyme 
(Dean  et  al.,  2002;  Lage  et  al.,  2003).  The  random  primers  first  anneal  to  the  denatured  DNA  and 
are  then  extended  by  thousands  of  bases  using  the  single  stranded  DNA  as  template.  The  5'  end 
of  any  extending  strand  is  displaced  by  another  upstream  strand  growing  in  the  same  direction 
(Fig.  1).  The  strand  displacement  reactions  generate  a  hyperbranching  mechanism  that  can 
produce  hundreds  or  thousands  of  copies  of  the  genome  in  only  a  few  hours.  As  a  result 
microgram  quantities  of  DNA  can  be  generated  from  as  little  as  5  ng  of  starting  material,  making 
it  ideally  suited  for  genomic  analysis  where  starting  DNA  is  limited. 

It  is  important  to  note  that  phi29  polymerase  has  a  greater  SDA  activity  than  the  Bst 
polymerase/T4  gene  32  protein  cocktail,  and  has  been  reported  to  introduce  a  significant  level  of 
under  and  over  representation  of  sequences  when  compared  to  the  enzyme  cocktail  (Lage  et  al., 
2003).  However,  several  groups  that  have  used  phi29  have  not  reported  any  inconsistencies  in 
their  results  (Hosono  et  al.,  2003;  Lovmar  et  al.,  2003;  Tranah  et  al.,  2003).  In  addition,  recent 
work  in  our  laboratory  that  has  compared  the  two  different  approaches  has  generated  near 
identical  results. 

To  date,  less  than  a  dozen  papers  reporting  the  use  of  SDA  for  the  study  of  human  disease  have 
been  published.  This  is  in  stark  contrast  to  the  several  dozen  reporting  the  use  of  DOP-PCR  and 
PEP.  Despite  this  disparity,  due  largely  to  the  recent  introduction  of  this  approach,  there  is  data 
covering  the  use  of  SDA  amplified  DNA  in  array  CGH  (Hughes  et  al.,  in  press;  Lage  et  al.,  2003), 
SNP  analysis  (Tranah  et  al.,  2003),  quantitative  PCR  (Hosono  et  al.,  2003;  Dean  et  al.,  2002)  and 
microsatellite  analysis  (Hosono  et  al.,  2003). 

A  study  of  47  loci  in  44  patients  by  Hosono  et  al.  (2003)  identified  an  amplification  bias  ranging 
from  0.5  to  3  fold,  which  is  in  agreement  with  previous  findings  (Dean  et  al.,  2002).  It  was  also 
determined  from  their  study  that  amplified  DNA  produced  results  indistinguishable  from 
unamplified  genomic  DNA.  Unfortunately  they  only  assayed  5  SNPs  and  10  microsatellites, 
which  is  hardly  a  comprehensive  evaluation.  Nevertheless,  the  results  are  promising.  In  addition, 
they  were  able  to  achieve  successful  SDA  directly  from  clinical  specimens,  such  as  whole  blood, 
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Fig.  1.  Strand  displacement  amplification.  Primers  anneal  at  multiple  sites  on  the  template  DNA.  The  polymerase 
initiates  primer  extension  and  new  strand  synthesis.  As  synthesis  progresses,  the  5'  end  of  the  extending  strand  is 
displaced  by  another  upstream  strand  extending  in  the  same  direction. 
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buccal  cells  and  buffy  coats,  which  eliminates  the  requirement  for  time-consuming  DNA 
extraction  processes. 

So  far,  only  three  studies  have  reported  a  direct  comparison  between  PCR-based  methods  and 
SDA.  Dean  et  al.  (2002)  using  TaqMan  analysis,  determined  an  amplification  bias  for  tested  loci 
of  approximately  3-fold  for  SDA  and  up  to  104  for  PEP  and  106  for  DOP-PCR.  In  addition,  an 
assessment  of  PEP  amplified,  SDA  amplified  and  unamplified  DNA  using  45  SNPs  established 
that  PEP  only  showed  88.7%  concordance  with  unamplified  DNA,  while  SDA  demonstrated 
99.7%  concordance  (Lovmar  et  al.,  2003).  Finally,  using  array  CGH  we  have  determined  that  the 
results  obtained  with  DOP-PCR  amplified  DNA  are  less  reliable  than  those  generated  using  SDA 
(Hughes  et  al.,  in  press). 

Unfortunately,  papers  reporting  the  use  of  SDA  have  so  far  concentrated  more  on  an 
evaluation  of  the  technique  rather  than  a  more  extensive  patient  study.  Current  work  in  our 
laboratory  suggests  that  the  fidelity  of  SDA  when  combined  with  LCM  and  array  CGH  can  allow 
for  the  detection  of  copy  number  abnormalities  from  a  small  number  of  starting  cells. 

3.2.  TV-based  linear  amplification  of  DNA  (TLAD) 

The  use  of  linear  amplification  strategies  has  been  demonstrated  by  mRNA  expression  analysis 
to  substantially  reduce  amplification  bias  (Iscove  et  al.,  2002;  Zhao  et  al.,  2002;  Wang  et  al.,  2000; 
Phillips  and  Eberwine,  1996).  However,  until  recently  this  type  of  approach  had  not  been  applied 
to  the  study  of  genomic  DNA,  due  to  the  absence  of  an  appropriate  tag  such  as  the  poly  A  tail 
present  on  mRNA  molecules.  Liu  et  al.  (2003)  has  devised  a  linear  amplification  protocol  for 
genomic  DNA,  which  will  be  applicable  to  the  amplification  of  DNA  of  varying  fragment  size  and 
quality.  This  will  be  of  particular  use  in  the  study  of  the  DNA  extracted  from  FFPE  tissue,  as  the 
quality  of  the  DNA  is  often  poor. 

T7-based  linear  amplification  is  capable  of  generating  microgram  quantities  of  genomic  DNA 
from  as  little  as  2.5  ng  of  input  DNA,  while  maintaining  the  variation  in  fragment  size  present  in 
the  starting  material.  TLAD  is  based  on  a  protocol  devised  by  Phillips  and  Eberwine  (1996)  for 
the  amplification  of  mRNA  for  use  on  cDNA  microarrays.  For  TLAD,  the  DNA  is  first  digested 
with  the  restriction  endonuclease,  Alul.  Terminal  transferase  is  then  used  to  add  polyT  tails  to  the 
3'  ends  of  the  digested  DNA  strands.  An  oligonucleotide  primer,  containing  a  5'  T7  promoter  and 
a  3'  polyA  tract,  is  annealed  to  the  genomic  fragments,  and  Klenow  is  used  for  synthesis  of  the 
second  strand.  The  reaction  products  of  this  synthesis  are  then  used  as  templates  for  the  in  vitro 
transcription  reaction.  One  of  the  major  advantages  of  TLAD  is  that  it  does  not  introduce  the 
sequence  and  length-dependent  biases  observed  in  PCR-based  amplification.  One  of  the  major 
disadvantages  is  the  requirement  for  sample  purification  following  each  step.  This  makes  the 
protocol  cumbersome  and  time-consuming  and  can  result  in  sample  loss. 

To  date  TLAD  has  only  been  validated  by  the  analysis  of  yeast  genomic  DNA  obtained 
following  chromatin  immunoprecipitation  (ChIP)  (Liu  et  al.,  2003),  so  it  is  of  interest  as  to 
whether  this  approach  will  be  appropriate  for  the  study  of  human  diseases.  In  our  laboratory 
we  have  been  able  to  successfully  use  TLAD  for  the  generation  of  DNA  for  array 
CGH  (unpublished  data).  The  CGH  profiles  generated  from  amplified  and  non-amplified 
genomic  DNA  extracted  from  a  well-characterized  cell  line  show  the  same  patterns  of  genomic 
alterations. 
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As  was  mentioned  previously  for  LMP,  more  extensive  investigation  into  the  reliability  of 
TLAD  will  be  required  before  it  will  be  generally  accepted  as  an  approach  for  faithful  WGA. 


4.  Conclusions 

Genetic  alterations  in  cancer  often  involve  loss  or  gain  of  chromosomal  regions.  Substantial 
research  effort  has  been  directed  towards  identifying  the  changes  in  a  cell  genome  that  occur 
during  the  development  and  progression  of  numerous  tumour  types.  These  analyses  have  focused 
upon  different  tumour  stages  in  an  attempt  to  determine  which  genomic  changes  occur  early  in  the 
tumour  development,  which  changes  are  accumulated  during  progression  and  which  occur  that 
may  facilitate  metastasis.  In  order  to  do  this,  it  has  been  necessary  to  analyse  DNA  from  cells  in 
all  the  different  stages.  This  has  led  to  two  problems,  firstly  if  the  DNA  is  limited,  as  is  often  the 
case  when  studying  premalignant  lesions  (Chung  et  al.,  2000;  Heinmoller  et  al.,  2003),  or  patient 
samples  (Zheng  et  al.,  2001),  only  a  restricted  number  of  assays  can  be  performed.  Secondly  when 
studying  heterogenous  tissue,  if  larger  quantities  of  “bulk  extracted”  genomic  DNA  are  used,  key 
genetic  alterations  that  could  be  of  major  interest,  may  be  masked  by  the  contaminating  normal 
cells  present  within  the  sample.  These  problems  have  led  researchers  to  use  WGA,  often  in 
conjunction  with  microdissection,  to  obtain  sufficient  amounts  of  DNA,  extracted  from  pure  cell 
populations,  for  genome  analysis.  Microdissection  can  be  used  to  obtain  a  single  cell  or  several 
thousands  cells,  however  at  best  the  yield  is  only  nanograms  of  DNA.  The  limitations  imposed  by 
insufficient  DNA  can  be  overcome  by  use  of  WGA  techniques  that  permit  apparent  unbiased 
amplification  across  the  genome,  thus  opening  up  the  possibility  for  new  scientific  approaches. 

Despite  their  differences,  WGA  techniques  all  work  on  the  premise  of  being  able  to  generate 
microgram  quantities  of  DNA,  depending  upon  the  method  used,  from  as  little  as  one  cell’s  worth 
of  DNA  (Hirose  et  al.,  2001).  Though  satisfactory  results  have  been  obtained  from  one  cell  using 
PEP,  unequal  allelic  amplification  has  been  observed  (Faulkner  and  Leigh,  1998;  Barrett  et  al., 
1995).  A  possible  explanation  for  this  is  the  fact  that  between  10%  and  20%  of  the  genetic 
material  from  a  single  nucleus  can  be  lost  during  tissue  sectioning  (Heinmoller  et  al.,  2002)  and 
subsequent  LCM.  In  addition,  fixation  of  tissues  can  introduce  sequence  variations  and  reduce 
overall  DNA  quality  (Williams  et  al.,  1999).  As  a  result  most  researchers  use  a  minimum  of 
10-1000  cells,  depending  on  tissue  type  (Heinmoller  et  al.,  2002),  fixation  method  (Dietmaier  et  al., 
1999)  and  mode  of  WGA  (Lage  et  al.,  2003;  Lovmar  et  al.,  2003;  Dean  et  al.,  2002;  Zheng 
et  al.,  2001). 

When  considering  the  use  of  WGA  it  is  important  to  take  into  account  the  entire  experimental 
process,  i.e.  tissue  accrual,  fixation,  sectioning,  staining  and  dissection.  All  of  these  factors  can 
effect  DNA  quality  and  thus  have  some  bearing  on  the  choice  of  WGA  methodology  (Ottesen 
et  al.,  2003;  Heinmoller  et  al.,  2002;  Dietmaier  et  al.,  1999).  For  instance,  SDA,  which  in 
comparison  to  other  methods  outlined  in  this  review  provides  good  genome  coverage  and  does 
not  introduce  significant  amplification  bias  (Dean  et  al.,  2002),  has  a  requirement  for  good  quality 
high  molecular  weight  DNA  usually  obtained  from  fresh  tissue.  As  a  result,  the  poor  quality  DNA 
often  extracted  from  fixed  tissue  will  not  be  suitable  for  this  approach  (Lage  et  al.,  2003).  In 
contrast  PCR-based  methods  and  TLAD  are  less  affected  by  DNA  quality  and  are  more 
applicable  to  all  DNA  sources.  However,  the  use  of  PCR-based  methods  has  its  own  inherent 
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problems.  When  studying  tissue  sections,  prior  examination  of  the  quality  of  DNA  in  the 
unrequired  regions  of  the  section  may  indicate  the  appropriate  WGA  approach  to  use. 

While  many  researchers  recognize  that  WGA  in  all  its  various  forms  has  a  range  of  limitations, 
very  few  address  them  in  terms  of  their  own  results,  likely  due  to  the  difficulty  in  assessing  the 
error  rate  that  may  be  associated  with  WGA  methods.  In  this  context  it  is  important  to  remember 
that  WGA  is  sometimes  the  only  means  to  significantly  increase  the  amount  of  data  that  can  be 
derived  from  limited  clinical  samples.  If  researchers  are  aware  of  the  possible  problems  and 
correctly  deal  with  the  potential  for  missing  data,  WGA  has  the  prospect  of  significantly 
expanding  our  knowledge  of  human  diseases  where  the  quantity  of  DNA  is  a  limiting  factor. 
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